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INTRODUCTION 


Breast  cancer  accounts  for  approximately  30%  of  all  diagnosed  cancer  and  is  the  most 
common  malignancy  in  women.  The  lethality  of  breast  cancer  is  mainly  attributed  to  its 
ability  to  metastasize  throughout  the  body.  According  to  the  American  Society,  the  five- 
year  survival  rate  for  individuals  with  metastatic  breast  cancer  is  26%,  in  contrast  to  98% 
for  those  with  non-metastatic  breast  cancer.  Although  there  has  been  significant  progress 
made  in  the  identification  and  understanding  of  breast  cancer  over  recent  years,  there 
remains  a  need  for  the  identification  of  molecular  markers  that  definitively  distinguish 
poorly  invasive/non-metastatic  tumors  from  highly  invasive/metastatic  tumors.  To 
identify  such  prognostic/predictive  markers,  a  better  understanding  of  the  molecular 
events  leading  to  breast  cancer  progression  is  needed. 

We  have  identified  a  molecular  marker  that  is  a  candidate  breast  cancer  metastasis 
suppressor.  This  marker,  termed  Heterochromatin  Protein  1  (HPlHsn),  is  an 
evolutionarily  conserved  non-histone  chromosomal  protein  (9).  HP1  primarily  localizes 
to  centric  heterochromatin  where  it  plays  a  role  in  chromosome  segregation  and  silencing 
of  genes  brought  into  juxtaposition  with  heterochromatin  (9).  HP1  also  localizes  within 
the  gene-rich  euchromatic  regions  of  the  genome  where  it  is  plays  a  role  in  gene 
regulation  (2).  We  discovered  that  HPlHsn,  one  of  three  HP1  proteins  in  humans,  was 
significantly  down-regulated  in  highly  invasive/metastatic  breast  cancer  cells  compared 
with  poorly  invasive/non-metastatic  breast  cancer  cells  (6).  This  observation  was  specific 
for  HPlHsa,  and  not  the  other  two  HP1  family  members,  HPlHsP  and  HPlHsY.  We  also 
discovered  a  similar  down-regulation  of  HP1Hs“  levels  in  breast  cancer  patients:  HP1Hs" 
was  abundant  in  the  nuclei  of  cells  from  primary  breast  tumors,  but  dramatically  reduced 
in  cells  of  metastatic  tissues  (6).  Given  the  role  of  HP1  proteins  in  gene  regulation,  we 
hypothesize  HP lHsct  alters  the  expression  of  genes  involved  in  invasion/metastasis. 

BODY 

Progress  made  within  the  past  year  on  each  task  is  described  below.  All  tasks  are 
identical  to  those  stated  in  the  approved  STATEMENT  OF  WORK  in  the  original 
proposal. 

Task  1:  Determine  the  consequences  of  HPlHsa  expression  on  tumor  metastasis 
markers  and  global  transcriptional  expression  in  human  breast  cancer  cells. 

This  task  is  based  on  the  observation  that  poorly  invasive/non-metastatic  breast  cancer 
cells  (MCF-7)  have  levels  of  HPlHsa  similar  to  most  cell  types,  whereas,  highly 
invasive/metastatic  cells  (MDA-MB-231)  have  low  levels  HPlHsa  (6).  Given  the  role  of 
HpiHsa  in  ggne  regUiation  (9)5  we  hypothesize  that  alterations  in  the  levels  of  HP1Hs" 
result  in  changes  in  gene  expression.  We  have  experimentally  modulated  the  levels  of 
HPlHsctin  both  MCF-7  and  MDA-MB-231  cells  and  assayed  for  changes  in  gene 
expression.  An  adenovirus  delivery  system  was  utilized  to  increase  levels  of  HPlHsain 
MDA-MB-231  cells.  This  viral  delivery  system  was  used  due  to  our  inability  to  isolate 
MDA-MB-23 1  cells  that  stably  expressed  an  EGFP-HPlHsa  transgene.  In  the  last  grant 
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period  we  demonstrated  that  expression  of  EGFP-HPlHs“  in  MDA-MB-231  reduced  in 
vitro  invasion  by  30%  relative  to  control  cells  expressing  a  nuclear  tagged  EGFP  (NLS- 
GFP)  (Figure  1).  Expression  of  a  mutant  form  of  HPlHstl  containing  an  amino  acid 
substitution  in  the  HPlHsachromo  shadow  domain  (I165E)  that  disrupts  HP1  dimerization 
did  not  alter  the  invasive  phenotype.  [See  Task  3  below  for  more  information  of 
HP1Hs"  domain  structure.]  In  contrast,  expression  of  a  mutant  form  of  HP lHs"  carrying  an 
amino  acid  substitution  in  the  HPlHsachromo  shadow  domain  (W174A)  that  disrupts 
interactions  with  known  protein  partners  containing  a  PxVxL  motif  reduced  invasion  to 
levels  similar  to  those  obtained  following  expression  of  EGFP-HP1Hs“.  Taken  these  data 
together,  we  conclude  that  increased  levels  of  HPlHsnand  dimerization  of  HP  lHs“  cause 
reduced  invasion  potential  in  MDA-MB-231  cells. 
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Figure  1.  Expression  of  EGFP- 
jjpjHsa  re(juces  invasion  of  MDA- 
MB-231  cells.  Cells  were  infected 
with  adenovirus  expressing  either 
wild  type  or  mutant  forms  of 
EGFP-  HPlHsa .  Results  were 
compared  to  cells  expressing  an 
EGFP-NLS  control.  Invasion 
assays  were  performed  according  to 
published  procedures  (3). 


In  this  grant  cycle  we  tested  for  effects  of  HPlHsaon  expression  of  candidate  HP1- 
regulted  genes.  Candidate  genes  included  those  involved  in  cancer  cell  invasion  and 
metastasis  as  well  as  known  HP  1 -target  genes.  RNA  from  MDA-MB-231  cells  infected 
with  adenovirus  expressing  EGFP-  HPlHsa,  the  I165E  mutant  form  of  HPlHsaor  the 
W174A  mutant  form  of  HPlHsawas  used  for  gene  expression  studies.  The  results  were 
compared  with  those  obtained  from  RNA  isolated  from  cells  expressing  the  NLS-GFP 
control  and  uninfected  cells.  Gene  expression  was  measured  by  semi-quantitative  RT- 
PCR  and  real-time  PCR.  Eight  genes  known  to  be  associated  with  HP1  did  not  change  in 
expression  upon  introduction  of  wild  type  or  mutant  forms  of  HP1Hs"  relative  to  control 
cells.  Even  though  these  genes  have  been  shown  to  be  associated  with  HPlHsa,  the  lack 
of  change  in  MDA-MB-231  cells  could  be  due  to  cell  type  specificity  for  HPlHsa 
association  and/or  compensation  by  other  HP1  family  members.  Five  tumor  suppressor 
genes,  including  BRCA1,  were  investigated  for  changes  in  gene  expression,  but  none  of 
these  genes  changed  in  MDA-MB-231  cells  upon  introduction  of  HPlHsn  relative  to 
control  cells.  Twenty  metastasis-associated  genes,  including  ten  integrin  genes  involved 
in  cell-cell  and  cell-extrcellular  matrix  interactions,  and  four  matrix  metalloproteases, 
were  tested  for  alterations  in  gene  expression.  One  metastasis-associated  gene,  integrin 
a3  ( ITGA3 )  showed  altered  expression  upon  changes  in  HPlHsn dosage  (Figure  2). 
Expression  of  HPlHsctand  the  W174A  mutant  form  of  HP lHsn resulted  a  3  to  6-fold  up- 
regulation  of  ITGA3  expression  compared  to  uninfected  MDA-MB-231  control  cells. 
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Figure  2.  ITGA3  is  up- 
regulated  in  response  to 
increased  levels  of 
HPlHsa  in  MDA-MB-231  cells. 

Top  represents  a  histogram  of 
ITGA  expression  under  each 
condition.  The  results  from  three 
independent  samples  are  shown. 
Below  are  the  semi-quantitative 
RT-PCR  results  for  one  sample. 


Expression  of  the  nuclear-tagged  EGFP  and  the  I165E  mutant  form  of  HP1  did  not  affect 
ITGA3  expression  compared  to  control  cells.  Therefore,  ITGA3  is  up-regulated  upon 
increased  levels  of  HPlHsn  in  MDA-MB-231  cells.  1TGA3  expression  has  been  correlated 
with  tumor  and  metastasis  suppression  (ref),  consistent  with  our  model  of  HP1  as  a 
metastasis  suppressor. 


As  a  second  approach  to  modulating  HPlHsa levels  in  breast  cancer  cells,  we  knock-down 
levels  of  HP lHs"  using  two  different  RNAi  methods.  First,  HP1Hsci  levels  were  knocked- 
down  using  an  adenovirus  expressing  an  shRNAs  against  HP1Hs".  We  use  adenovirus 
delivery  when  large  numbers  of  cells  are  required  for  down-stream  analysis,  such  as  in 
vitro  invasion  assays  (3).  An  adenovirus  expressing  shRNA  to  GFP  was  used  as  a  non¬ 
specific  control  to  rule  out  possible  effects  due  to  viral  infection.  Second,  HP lHsct  levels 
were  knocked-down  using  small  siRNAs  purchased  from  Dharmacon.  While  the 
purchased  siRNAs  are  expensive  and  not  economical  for  large-scale  experiments,  the  use 
of  synthetic  dsRNAs  eliminates  viral  effects.  Hence,  the  siRNAs  were  used  for  the 
global  gene  expression  studies. 


HP1H“ 


0-tubulin 


HP1HsP 


HP1H,, 


uninfected  shHP1H“  shGFP 


Figure  3.  Knock-down  of 
HP1Hs“  using  shRNAs  expressed 
from  an  adenovirus  construct. 

An  adenovirus  expressing  a 
shRNA  against  GFP  was  used  as 
a  negative  control.  Knock-down 
was  specific  for  HPlHsa .  Western 
analysis  was  performed  72  hours 
post  infection. 
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Figure  4.  Knock-down  of  causes 
increased  invasion  of  MCF-7 
cells.  Cells  infected  with  an 
adenovirus  expressed  shGFP  serve 
as  controls. 


Using  shRNAs  we  obtained  greater  than  95%  knock-down  of  HP1  in  MCF-7  cells  as 
evidenced  by  western  analysis  (Figure  3).  Knock-down  was  specific  for  HPlHsa;  levels  of 
HPlHspand  HP1Hsy  did  not  dramatically  change  in  the  HP  lHsa  knock-down  cells  relative  to 
controls  (Figure  3). 


To  identify  changes  in  gene  expression  that  might  account  for  the  reduction  in  in  vitro 
invasion  upon  expression  of  HPlHsa,  we  performed  microarray  analysis  of  RNA  isolated 
from  HP1Hs"  knock-down  cells  and  cells  transfected  with  a  control  scrambled  dsRNA 
(Ambion).  Microarray  analysis  has  been  performed  on  three  independent  experimental 
and  control  RNA  samples  and  the  results  are  currently  being  analyzed  using  Microsoft 
Suite  and  R.  Approximately  200  genes  show  a  2-fold  increase  in  expression  upon  knock¬ 
down  of  HPlHs“,  and  approximately  200  genes  show  a  decrease  in  expression.  These 
numbers  are  consistent  with  the  data  we  obtained  from  gene  expression  studies  of  HP1 
mutants  in  Drosophila  (2).  A  list  of  the  top  10  genes  that  increase  in  the  HP  lHsa  knock¬ 
down  cells  relative  to  control  cells  is  shown  below  (Table  1).  We  are  in  the  process  of 
performing  statistical  calculations  on  the  microarray  data,  perform  gene  ontology  analysis 
on  the  genes  that  change  expression,  and  confirming  changes  in  gene  expression  by  RT- 
PCR.  The  data  will  provide  insights  on  the  molecular  pathways  that  lead  to 
invasion/metastasis. 


untransfected  dsHP1Hsa  non-specific  dsRNA 


Figure  5.  Knock-down  of 
using  dsRNA.  Knock-down 
is  specific  for  HPlHsa;  levels 
of  HPlHsp and  HPlHsYarenot 
dramatically  altered.  Western 
analysis  was  performed  72 
hours  after  transfection  with 
dsRNA. 
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Task  2:  Determine  the  molecular  mechanisms  of  HPlHsa  down-regulation  in  human 
breast  cancer  invasion/metastasis. 

This  task  was  completed  and  published  during  the  previous  year. 

Task  3:  Determine  the  domains  of  HP1Hs“  required  for  invasion  and  metastasis. 

HP1Hs"  has  a  two-domain  structure  consisting  of  an  amino  chromo  domain  (CD)  and  a 
carboxy  chromo  shadow  domain  (CSD).  The  CD  associates  with  methylated  lysine  9  of 
histone  H3  (5)  and  is  thought  to  be  the  primary  mechanism  of  HP1  localization  within 
centric  heterochromatin.  The  CSD  homodimerizes  (1);  this  dimerization  generates  a 
surface  that  interacts  with  a  variety  of  proteins  possessing  a  PxVxL  pentapeptide  motif 
(8,  10).  In  order  to  determine  the  domains  of  HPlHs"  involved  in  metastasis,  we 
generated  two  mutant  forms  of  HPlHs“.  The  first  is  an  amino  acid  substitution  in  the  a- 
helical  region  of  the  CD  that  disrupts  the  dimerization  of  HP1  (7).  The  second  is  an 
amino  acid  substitution  within  the  CSD  (W174A)  that  disrupts  interactions  with  the 
pentapeptide  motif-containing  proteins  (10).  During  the  previous  grant  period,  we 
demonstrated  that  expression  of  wild  type  HPlHsn  correlated  with  a  30%  reduction  in  in 
vitro  invasion  relative  to  controls  (Figure  1).  A  30%  reduction  was  also  observed  for  the 
W174A  mutation,  suggesting  that  interactions  with  penta-peptide  motif  containing 
proteins  do  not  play  a  role  in  regulating  invasion.  In  contrast,  there  was  no  change  in 
invasion  following  expression  of  the  I165E,  which  inhibits  dimerization  relative  to  the 
control  cells  (Figure  1). 

In  this  grant  cycle  we  have  addressed  the  effects  of  these  mutants  on  candidate  gene 
expression.  ITGA3  expression  is  altered  upon  introduction  of  wild  type  HPlHsaand  the 
W174A  mutant,  but  not  the  I165E  mutant  (Figure  2).  This  data  is  consistent  with  the 
changes  observed  for  in  vitro  invasion,  where  dimerization  appears  to  be  critical  for 
decreasing  invasion  potential  (Figure  1  and  text  of  Task  1).  ITGA3  gene  expression  has 
been  correlated  with  tumor  and  metastasis  suppression  (4).  We  are  currently  verifying 
the  changes  in  1TGA3  by  real-time  PCR  and  performing  immunostaining  to  examine 
ITGA3  protein  levels. 


KEY  RESEARCH  ACCOMPLISHEMENTS 

Obtained  greater  than  95%  knock-down  of  HPlHsa  using  adenoviral  vectors 
expressing  shRNAs  directed  against  HP1Hs". 

Demonstrated  that  MCF-7  cells  expressing  the  shRNA  against  HPlHsahave  a  40% 
increased  invasion  potential  than  control  shRNA  expressing  cells. 

Obtained  greater  than  95%  knock-down  of  HP1  in  MCF-7  cells  using  short 
synthetic  siRNAs. 

Demonstrated  that  knock-down  of  HPlHsain  MCF-7  cells  does  not  dramatically 
alter  the  levels  of  HPlHsf5and  HPlHsn. 
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Demonstrated  that  alterations  in  HP1Hs“  levels  within  MDA-MB-231  cells  and 
MCF-7  cells  does  not  alter  their  growth  rate  (data  not  shown),  supporting 
HPlHsaas  a  metastasis  suppressor,  rather  than  a  tumor  suppressor. 

Demonstrated  that  increased  expression  of  HPlHs“in  MDA-MB-231  cells  alters 
expression  of  INTGA3. 

Performed  microarray  analysis  of  MCF-7  with  reduced  levels  of  HPlHsa. 
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During  the  past  year  we  have  completed  all  of  the  tasks  described  in  the  original 
proposal.  Our  findings  indicate  that  altered  levels  of  HPlHsain  breast  cancer  cells  leads  to 
changes  in  invasion  and  gene  expression.  We  have  requested  a  “no  cost  extension”  to 
cover  the  expenses  associated  with  the  following:  (1)  verification  of  the  microarray 
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results  by  a  second  assay  such  as  RT-PCR,  (2)  publication  of  a  second  manuscript  on  the 
new  findings  reported  above  (targeted  for  publication  in  Cancer  Research)  and  (3)  travel 
and  lodging  for  the  2005  Era  of  Hope  meeting  (see  Abstracts  below).  Our  findings  are 
consistent  with  the  idea  that  HPlHsa  is  one  of  a  small  class  of  genes  known  as  metastasis 
suppressor.  Our  research  will  be  of  great  interest  to  investigators  that  work  in  the  areas 
of  breast  cancer  metastasis,  chromatin  structure  and  gene  expression. 
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Abstract 

Heterochromatin  protein  1  Hsa  (HP1Hs“)  is  one  of  three  human  proteins  that  share  sequence  similarity  with  Drosophila  HP1.  HP1 
proteins  are  enriched  at  centric  heterochromatin  and  play  a  role  in  chromatin  packaging  and  gene  regulation.  In  humans,  HPIHsct  is  down- 
regulated  in  highly  invasive/metastatic  breast  cancer  cells,  compared  to  poorly  invasive/non-metastatic  breast  cancer  cells.  To  gain  insight 
into  this  differential  regulation,  we  have  cloned  the  HPIHm  gene  and  characterized  its  genomic  region.  HPlHsCL  is  located  on  human 
chromosome  12q  13.13,  589  bp  upstream  of  the  divergently  transcribed  hnRNPAl  gene.  Analysis  of  the  promoter  region  revealed  that 
differential  regulation  of  PIP l Hs*  between  the  two  types  of  breast  cancer  cells  is  lost  upon  mutation  of  an  USF/c-myc  transcription  factor 
binding  site  located  1 72  bp  upstream  of  the  predicted  HP1h's,x  transcription  start  site.  These  findings  provide  insights  into  the  down-regulation 
of  HPlHsa  in  highly  invasive/metastatic  breast  cancer  cells.  To  examine  the  functional  properties  of  HPlHsa,  experiments  were  performed 
using  Drosophila  melanogaster  as  a  genetic  system.  When  human  HP1Hs“  was  expressed  in  transgenic  Drosophila,  silencing  of  reporter 
genes  inserted  at  centric  and  telomeric  locations  was  enhanced.  Furthermore,  expression  of  HP1Hs"  rescued  the  lethality  of  homozygous 
Su(var)2-5  mutants  lacking  HP1.  Taken  together,  these  results  demonstrate  the  participation  of  T IP  1 1  f  '"  in  silent  chromatin  formation  and  that 
HPlHsa  is  a  functional  homologue  of  Drosophila  HP1. 

©  2004  Elsevier  B.V.  All  rights  reserved. 

Keywords:  Breast  cancer  metastasis;  Drosophila;  Gene  silencing;  Heterochromatin 


1.  Introduction 

Heterochromatin  protein  1  (HP1)  was  first  discovered  in 
Drosophila  melanogaster  and  has  since  been  found  in  a 
variety  of  eukaryotes  from  Schizosaccharomyces  pombe  to 
humans  (Eissenberg  and  Elgin,  2000).  Drosophila,  mice  and 
humans  have  three  HP1  family  members.  All  HP1  proteins 


Abbreviations:  bp,  base  pair(s);  kb,  kilobase(s);  HP1,  heterochromatin 
protein  I;  CD,  chromo  domain;  CSD,  chromo  shadow  domain;  CAF1, 
chromatin  assembly  factor  1;  pg,  microgram;  EGFP,  enhanced  green 
fluorescent  protein;  PCR,  polymerase  chain  reaction;  5'  RACE,  5'  rapid 
amplification  of  cDNA  ends;  CMV,  cytolomegalovirus;  USF,  upstream 
stimulatory  factor. 
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have  a  conserved  amino  domain  termed  the  chromo  domain 
(CD)  and  a  carboxy  domain  termed  the  chromo  shadow 
domain  (CSD),  separated  by  a  less  conserved  hinge  region 
(Eissenberg  and  Elgin,  2000).  The  HP1  CD  binds  to 
methylated  lysine  nine  of  histone  H3;  this  interaction  is 
important  for  HP1  localization  at  centric  regions  of  chro¬ 
mosomes  (Brehm  et  al.,  2004). 

The  HP1  CSD  homodimerizes,  forming  a  site  of 
interaction  for  several  nuclear  proteins  possessing  a 
pentapeptide  motif  (PxVxL),  including  the  HP1  CSD 
itself  (Cowieson  et  al.,  2000;  Smothers  and  Henikoff, 
2000).  The  hinge  region  of  some  HP1  family  members 
interacts  with  RNA  and  histone  HI  (Nielsen  et  al.,  2001; 
Muchardt  et  al.,  2002).  Thus,  HP1  can  be  thought  of  as  a 
bridging  molecule  that  links  various  nuclear  proteins  to 
the  chromosome. 
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The  known  functions  of  HP1  proteins  are  largely  based 
on  genetic  data.  HP1  was  identified  in  Drosophila  mutagen¬ 
esis  screens  for  modifiers  of  heterochromatic  gene  silencing 
(Weiler  and  Wakimoto,  1995).  Mutations  in  the  Drosophila 
gene  encoding  HP1,  Su(var)2-5,  are  homozygous  lethal; 
heterozygotes  show  suppression  of  silencing  of  genes 
placed  near  heterochromatin,  implying  a  role  for  HP1  in 
chromatin  packaging.  In  S.  pombe,  mutations  in  the  HP1- 
like  protein  Swi6  lead  to  chromosome  segregation  defects 
(Ekwall  et  ah,  1995).  Information  gleaned  from  studies  of 
HP1  in  model  organisms  allows  one  to  infer  the  function  of 
HP1  homologs  in  mammalian  cells,  where  genetic  assays 
are  not  currently  available. 

In  humans,  HPlHsa  is  specifically  down-regulated  in 
highly  invasive/metastatic  breast  cancer  cells  compared  with 
poorly  invasive/non-metastatic  breast  cancer  cells,  both  at 
mRNA  and  protein  levels  (Kirschmann  et  ah,  2000).  Consis¬ 
tent  with  these  cell  culture  phenotypes,  staining  of  breast 
cancer  tissue  samples  with  antibodies  to  HP1Hs“  showed  that 
HP1Hs“  was  decreased  in  distant  metastases  compared  to 
primary  tumor  tissues  (Kirschmann  et  ah,  2000).  In  this  study, 
we  identify  sequences  within  the  HP1Hsx  promoter  region  that 
are  responsible  for  differential  expression  in  metastatic  vs. 
nonmetastatic  breast  cancer  cell  lines.  In  addition,  we  show 
here  that  HPlHsa  is  a  functional  homolog  of  Drosophila  HP1. 

2.  Materials  and  methods 

2.1.  Isolation  of  genomic  clones 

A  lambda  genomic  library  made  from  the  whole  placenta 
of  a  27-year-old  healthy  Caucasian  female  (Clontech)  was 
screened  to  recover  HP1Hs*  genomic  clones.  1.5  x  106 
independent  clones  were  screened  using  standard  techni¬ 
ques.  HP1Hsx  cDNA  was  random  prime  labeled  with  32P- 
dATP  (Amersham  Multiprime  labeling  kit)  and  used  for 
hybridization.  Southern  analysis  was  used  to  identify  frag¬ 
ments  within  the  clones  containing  HPl"sy.  These  frag¬ 
ments  were  isolated  and  sequenced  to  determine  intron/exon 
boundaries  of  the  HPlHsa  gene. 

2.2.  5'  RACE 

RNA  was  isolated  from  MCF-7  cells  using  TRIzol 
reagent  (Life  Technologies).  RNA  was  amplified  according 
to  the  5'  RACE  System  (Life  Technologies)  using  a  primer 
specific  to  a  region  34-59  bp  downstream  of  the  stop 
codon.  cDNA  was  amplified  at  50°C  instead  of  the  typical 
42°C  to  minimize  secondary  structure.  The  cDNA  was  PCR 
amplified  using  the  abridged  anchor  primer  (Life  Technol¬ 
ogies)  specific  to  the  C-tailed  cDNA  and  a  primer  specific  to 
the  fifth  exon.  An  additional  extension  cycle  at  72 °C  for  3 
min  was  added  at  the  end  of  the  PCR  cycles.  The  PCR 
products  were  cloned  (TA  cloning  system,  Invitrogen)  and 
sequenced. 


2.3.  Cells  and  culture  conditions 

MCF-7  cells  were  kindly  supplied  by  Dr.  F.  Miller 
(Karmanos  Cancer  Institute,  Detroit,  MI).  MDA-MB-231 
cells  were  obtained  from  the  American  Type  Culture  Col¬ 
lection  (ATCC,  Manassas,  VA).  Cell  lines  were  maintained 
as  previously  described  (Kirschmann  et  al.,  1999). 

2.4.  Bisulfite  genomic  sequencing 

Genomic  DNA  was  isolated  and  analyzed  by  bisulfite 
genomic  sequencing  as  previously  described  (Kladde  et  al., 
1996).  PCR  products,  amplified  from  bisulfite-deaminated 
DNA  using  JumpStart  Taq  DNA  polymerase  (Sigma),  were 
purified  and  subjected  to  primer  extension  as  described 
(Kladde  et  ah,  1996),  except  that  the  final  concentrations 
of  dNTPs  (A,  C,  T)  and  ddGTP  were  50  and  150  p,M, 
respectively.  Exclusion  of  dGTP  from  the  PCR  product 
primer  extension  reactions  generated  high  termination  effi¬ 
ciencies  (>96%)  (Kladde  et  ah,  1996)  at  template  cytidines 
(nucleotides  that  were  5meC  residues  in  vivo). 

2.5.  Plasmid  constructs 

Fragments  of  the  HP1Hs *  promoter  region  (positions 
-600,  -466,  -418,  -286,  -  166,  and  -110  bp 
relative  to  transcription  start  at  +1)  fused  to  exon  one 
(+143)  were  cloned  into  the  pGL3-Basic  vector  (Promega). 
Forward  primers  used  for  generating  the  deletion  constructs 
were  as  follows:  -  600  bp  primer  5'-GCAGAAAGGAGC- 
GAGCTCACGAACGTATC-3',  -466  bp  primer  5'- 
CCTGCTATTGAGCTCTGGTGCCAC  ATTGC-3'  ,-418 
bp  primer  5'-GGTCGTTCTACGAGCTCTCCACC-3', 
-286  bp  primer  5'  -CTTCCACGAGCTCATATTAC AGT- 
CAAG-3',  -166  bp  primer  5'-GTAAAATGGCGA- 
GCTCTGCGCA-3' ,  -110  bp  primer  5'  -CGTGAAATG- 
GAGCTCAGGAGTAGG-3' .  The  reverse  primer  used  for 
all  of  the  HPlHsy  promoter  deletion  constructs  recognizes 
HP1Hsx  exon  one  and  the  pGL3-Basic  vector:  5'- 
AGATCTCGAGCCCGGGATTGAGAGTGATCA-3' .  The 
mutant  transcription  factor  binding  sites  were  generated 
using  the  QuikChange  site-directed  mutagenesis  kit  (Stra- 
tagene)  and  the  following  primers:  8EF1  (mSEFl): 
5'-CGCATTAAGAAGTTCCCCCCCCCTCTGAGAA- 
CACG-3'  and  5'-CGTGTTCTCAGAGGGGGGGG- 
GAACTTCTTAATGCG-3',  c-myb  (mmyb):  5'- 
CGTTTTGGCGGGCCCCCCCCCTTGCGCAGAAGG-3' 
and  5'-CCTTCTGCGCAAGGGGGGGGGCCCGCC- 
AAAACG-3',  USF’c-myc  site  at  position  —  109  (mUSFp): 
5'-CCTCTGAGACCCCCCCCAATGGCGGGCAGGAG- 
TAG-3'  and  5'-CTACTCCTGCCCGCCATTGGGG- 
GGGGTCTCAGAGG-3',  USF/c-myc  site  at  position 
-172  (mUSFd):  5'-CTCTTGTTGACCGGGGG- 
GGGAGTAAAATGGCG-3'  and  5'-CGCCATTT- 
TACTCCCCCCCCGGTC AAC AAGAG-3' .  The  double 
mutants  of  mmyb  and  mUSFp  (mmybUSFp)  and  mUSFp 
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and  mUSFd  (mUSFpUSFd)  were  also  made  using  the 
above  primers. 

2.6.  Transient  transfection  assays 

MCF-7  and  MDA-MB-231  cells  were  grown  to  80% 
confluency.  A  total  of  1  pg  of  DNA,  including  0.5  pg  of 
promoter  construct  and  0.5  pg  of  CMV-lacZ  (kind  gift  of 
Dr.  Andrew  Russo),  was  transfected  into  the  cell  lines 
using  Effectene  Transfection  Reagent  (Qiagen).  The  cells 
were  grown  for  48  h,  collected  with  Cell  Culture  Lysis 
Reagent  (Promega),  and  assayed  for  luciferase  and  p>- 
galatosidase  expression.  Luciferase  expression  was  mea¬ 
sured  using  Luciferase  Assay  Substrate  (Promega)  to 
monitor  expression  from  HP1Hsx  promoter  constmcts.  p>- 
Galactosidase  expression  was  measured  using  Galacto- 
Light  Plus  System  (Applied  Biosystems)  to  normalize 
for  transfection  efficiency.  Light  units  were  measured  on 
a  96-well  plate  luminometer  (Dynex).  Normalized  lucifer¬ 
ase  light  unit  measurements  were  set  relative  to  light  unit 
measurements  obtained  for  a  promoter  construct  contain¬ 
ing  4  kb  of  HP1Hsx  upstream  sequences,  including  exon 
one.  This  4-kb  construct  gives  uniform  low  levels  of 
expression  in  both  cell  lines.  These  data  were  analyzed 
using  the  Microsoft  Excel  two  samples  unequal  variance 
Student’s  /-test. 

2.7.  P-element  construct  and  Drosophila  germ  line 
transformation 

HP1Hsx  was  fused  in  frame  with  EGFP  and  inserted 
into  the  P-element  vector  pCaSpeRhs-act  (http://thummel. 
genetics.utah.edu/)  containing  an  hsp70  promoter  to  drive 
expression  of  the  fusion  gene  and  a  mini -white*  gene  for 
selection  of  transformants.  To  generate  an  untagged 
Hpih m  construct,  HPlHsct  cDNA  was  inserted  into  the 
P-element  vector  pCaSpeR-hs-act.  Both  resulting  P-ele¬ 
ment  constructs  were  independently  injected  into  y,  w67c23 
Drosophila  embryos,  along  with  P-turbo  helper  plasmid 
encoding  transposase,  according  to  standard  germ-line 
transformation  procedures.  Daily  heat-shock  treatments 
lead  to  an  estimated  three-fold  higher  expression  of 
HP1  !Is“-EGFP  than  the  endogenous  HP1  protein  as  judged 
by  western  analysis  (data  not  shown). 

2.8.  Drosophila  genetics 

All  Drosophila  stocks  were  raised  on  standard  com 
meal  sucrose  media  (Shaffer  et  al.,  1994)  at  25  °C.  Females 
with  the  genotype  P[w+,  hsp70-HPlHsx];  Su(var)2-504/ 
CyO,  GFP  were  crossed  to  males  of  the  genotype 
Su(var)2-502/CyO,  GFP.  Crosses  were  heat  shocked  at 
37 °C  for  45  min  daily.  Rescue  of  lethality  was  indicated 
by  the  presence  of  straight  winged  adults,  representing 
the  genotype  P[w+,  hsp70-HPlHsx] ;  Su(var)2-504/ 
Su(var)2-502 . 


2.9.  Immunostaining  of  polytene  chromosomes 

Third  instar  larvae  were  heat  shocked  at  37 °C  for  1  h  and 
allowed  to  recover  at  room  temperature  for  2  h.  Salivary 
glands  were  dissected,  fixed,  squashed  and  stained  with  a 
monoclonal  antibody  to  HP1  (C1A9)  and  a  polyclonal 
antibody  to  GFP  (Molecular  Probes)  according  to  published 
procedures  (Platero  et  al.,  1995). 

2.10.  Northern  analysis 

RNA  for  Northern  analysis  was  isolated  from  third  instar 
larvae  after  heat  shock  at  37°C  for  1  h  according  to 
published  procedures  (Wallrath  et  al.,  1990).  Levels  of 
mRNA  produced  by  the  heterochromatic  transgenes  were 
measured  by  hybridization  with  barley  cDNA  sequences 
fused  to  the  hsp26  transgene  and  labeled  with  32P-dATP 
(Amersham)  using  random  prime  labeling  (Amersham).  An 
rp49  cDNA  was  used  as  a  control  for  RNA  loading. 


3.  Results 

3.1.  Structure  of  the  HP1Hs“  genomic  region 

HP1Hs*  is  down-regulated  in  highly  invasive/metastatic 
breast  cancer  cells  in  comparison  to  poorly  invasive/non- 
metatstatic  breast  cancer  cells  (Kirschmann  et  al.,  2000).  To 
better  understand  the  mechanism  of  HP1Hsx  down-regula¬ 
tion,  we  have  determined  the  structure  of  the  HP1Hsx 
genomic  region,  including  the  promoter  region  (Fig.  1). 
Clone  F2-10  contains  exons  two  and  three  surrounded  by 
repetitive  sequences  typically  found  in  introns.  This  clone 
spans  a  region  approximately  9  kb  upstream  of  exon  two  to 
300  bp  downstream  of  exon  three.  Exon  two  contains  the 
methionine  translation  start  codon.  Clone  F2-11  contains 
exons  three,  four,  and  five.  A  second  screen,  using  sequen¬ 
ces  corresponding  to  exon  one  and  150  bp  upstream  as  a 
probe,  identified  four  clones  containing  the  HP1Hsx  promot¬ 
er  region.  Clone  3-4  contains  11  kb  of  the  HP1Hsx  promoter 
region  in  addition  to  exon  one  that  is  5'  untranslated 
sequence.  Taken  together,  HP1Hsx  is  encoded  by  five  exons 
spanning  38  kb. 

5'  RACE  was  performed  using  a  primer  to  HP1Hsx 
(positions  +799  to  +824)  to  identity  the  potential 
transcription  start  site.  Three  products,  having  their  5' 
ends  within  22  bp  of  each  other,  were  identified.  We 
designated  + 1  as  the  5'  end  of  the  longest  5'  RACE 
product,  extending  exon  one  of  HP1Hsx  an  additional  54 
bp  upstream  as  compared  to  the  NCB1  CBX5  cDNA 
sequence  NM  012117  (Fig.  1). 

A  bioinformatics  analysis  was  performed  on  HP1Hsx 
promoter  region  sequences.  Using  Matlnspector  V2.2 
http://transfac.gbf.de/TRANSFAC/)  at  stringent  conditions 
(core  sim  1.0,  matrix  sim  0.95),  sequences  from  —601  to 
+  143  were  analyzed  for  known  transcription  factor  binding 
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Genomic  Structure: 


exon  1  exon  2  exon  3  exon  4  exon  5 
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Fig.  1 .  The  genomic  structure  of  the  human  HP1h gene  with  the  lengths  of  each  exon  and  intron  in  bp  or  kb  are  shown.  Transcription  start  sites  are  assigned 
on  the  basis  of  5'  RACE  products.  The  translation  methionine  start  (ATG)  and  stop  are  shown.  The  genomic  clones  are  represented  by  lines  under  the 
corresponding  genomic  regions.  Three  clones  obtained  using  5'  RACE  start  within  54  bp  upstream  of  the  cDNA  sequences  previously  published  in  Genbank. 
The  clone  containing  the  most  5'  sequence  was  designated  +  I;  additional  clones  start  at  +20  and  +22. 


sites.  Using  these  criteria,  37  binding  sites,  some  over¬ 
lapping  with  each  other,  were  identified  within  this  region, 
but  no  TATA  box  was  identified.  The  lack  of  a  TATA  box  is 
consistent  with  having  multiple  transcription  start  sites 
(Gum  et  al.,  2003). 

The  hnRNPAl  gene  is  divergently  transcribed,  starting  at 
position  -  589  bp  upstream  of  the  HPlHsa  transcription  start 
(Fig.  3 A).  Although  the  promoter  regions  of  hnRNPAl 
and  HPlHsa  are  in  close  proximity,  they  appear  to  be 
independently  regulated  in  breast  cancer  cells.  Unlike  the 
different  levels  of  HP1Hs"  observed  in  the  two  breast 
cancer  cell  lines,  hnRNPAl  levels  are  unchanged  be¬ 
tween  MDA-MB-231  and  MCF-7  cells  (data  not  shown). 
DNase  I  footprinting  of  the  region  between  the  hnRNPAl 
and  HPl"sa  genes  was  performed  using  HeLa  cells 
(Biamonti  et  al.,  1993).  Six  potential  transcription  factor 
binding  sites  found  in  our  bioinformatics  analysis  of  the 
HPlHsa  promoter  region  correspond  to  the  DNase  1 
footprints  previously  identified,  including  two  SP1  sites, 
two  CAAT  boxes,  a  CREB/c-jun  site,  two  USF/c-myc 
sites,  and  a  c-myb  site  (Biamonti  et  al.,  1993)  (Fig.  3A). 
A  8EF1  site  within  the  promoter  region  is  also  of 
interest.  The  human  AREB6  repressor  protein  that  binds 
to  the  8EF1  site  is  up-regulated  in  highly  invasive/ 
metastatic  cell  lines  compared  to  poorly  invasive/non- 
metastatic  cell  lines  (Kirschmann  et  al.,  1999).  Therefore, 
the  8EF1  site  was  analyzed  as  a  candidate  regulatory 
element  (Fig.  3A). 


3.2.  Mechanism  o/HP1Hs"  differential  regulation 

The  well-characterized  breast  cancer  cell  lines  MDA- 
MB-231,  which  is  highly  invasive/metastatic,  and  MCF-7, 
which  is  poorly  invasive/non-metastatic,  were  used  to 
determine  the  mechanism  of  differential  regulation  of 
HP1h'7.  One  possible  explanation  for  differential  expres¬ 
sion  of  HP1Hsx  is  that  a  mutation  in  the  HPlHsy  gene 
within  MDA-MB-231  cells  results  in  reduced  expression. 
The  coding  region,  splice  junctions,  portions  of  the 
introns,  and  150  bp  of  the  promoter  region  of  HPlhkj 
from  MDA-MB-231  and  MCF-7  cells  were  sequenced 
and  compared.  No  differences  between  the  HPlHsa  ge¬ 
nomic  sequences  of  the  two  cell  lines  were  found. 
Therefore,  differential  regulation  of  HPlHsa  is  not  likely 
due  to  mutations  within  the  sequenced  regions  of 
HP1Hs: \ 

As  a  second  possibility  to  explain  the  differential 
expression,  we  investigated  the  DNA  methylation  status 
of  HP1Hsx  in  both  cell  types.  In  many  cases  transcriptional 
regulation  in  cancer  cells  is  under  control  of  DNA  meth¬ 
ylation,  particularly  for  genes  near  CpG  islands  (Dallol  et 
al.,  2003).  A  CpG  island  within  exon  one  of  the  hnRNPAl 
gene  (—482  to  —  899  from  HPlHsa  transcription  start)  was 
identified  using  CpGReport  (http://www.ebi.ac.uk/).  Both 
strands  of  the  HP1Hsx  promoter  region  (bases  -  900  to 
+  168),  including  the  CpG  island,  were  subjected  to 
bisulfite  sequencing  to  determine  DNA  methylation  of 
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Fig.  2.  Absence  of  detectable  5-methylcytosine  (5m<:C)  within  the  HPIh 
promoter  region  in  MCF-7  and  MDA-MB-231  cells.  DNA  from  both  cell 
lines  (lanes  1  and  2)  was  subjected  to  a  sensitive  variation  of  genomic 
bisulfite  sequencing  that  is  able  to  detect  low  levels  of  5mcC.  Analysis  of 
sequences  from  -220  to  +168  of  the  HP1Hs*  promoter  is  shown. 
Plasmid  DNA  containing  the  HPIh ’**  promoter  methylated  in  vitro  by  M. 
SssI  (lane  4)  provides  a  marker  for  modified  CpG  sites  (filled  circles). 
Since  the  plasmid  was  isolated  from  a  dcm+  strain  of  E.  coli,  methylation 
at  a  dcm  site  (arrow  head)  was  also  detected  (lanes  3  and  4).  Methylation 
at  the  dcm-  and  M.  Siif-modified  sites  demonstrates  the  signal  intensity 
that  is  commensurate  with  high  and  moderate  levels  of  DNA  methylation, 
respectively.  Thus,  MCF-7  and  MDA-MB-231  cells  do  not  have 
detectable  levels  of  5mcC.  Reactions  carried  out  on  purified  plasmid 
DNA  are  labeled  D. 


HP1Hs*  in  the  MDA-MB-231  and  MCF-7  cell  lines  (Fig.  2 
shows  the  methylation  status  of  bases  —220  to  +168; 
bases  —900  to  —220  are  not  shown).  Limited  DNA 
methylation,  if  any,  was  observed  in  either  cell  type,  and 
the  methylation  status  was  unchanged  between  the  two  cell 
lines  throughout  the  HP1Hs7  promoter  region  and  exon 
one,  including  the  CpG  island  within  htiRNPAl.  Thus, 
methylation  does  not  appear  to  be  involved  in  the  differ¬ 
ential  regulation  of  HPlHm. 

As  a  third  explanation  for  the  differential  regulation,  we 
hypothesized  that  differential  expression  might  arise 
through  different  interactions  between  transcription  factors 
and  c/.s-acting  DNA  elements  of  the  HPlHsy  promoter  in 
the  two  cell  types.  Fragments  of  the  HP1Hsj  promoter, 
including  untranslated  exon  one,  were  cloned  upstream  of 
a  lucif erase  reporter  gene.  These  constructs  were  co-trans¬ 


fected  into  MDA-MB-231  and  MCF-7  cells  along  with  a 
plasmid  containing  the  CMV  promoter  driving  a  lacZ 
reporter  gene.  Expression  of  luciferase  and  (3-galactosidase 
was  measured  in  light  units.  Luciferase  expression  was 
normalized  to  p-galactosidase  expression  to  account  for 
differences  in  transfection  efficiency.  Levels  of  luciferase 
above  background  were  observed  from  all  constructs, 
indicating  that  the  cloned  promoter  region  possessed 
transcriptional  activity.  A  5.6-fold  difference  in  luciferase 
activity  was  observed  between  the  MDA-MB-231  and 
MCF-7  cell  lines  for  the  largest  construct,  —  600/+143, 
containing  13  sequences  between  the  HP1Hs“  and  the 
hnRNPAl  predicted  transcription  start  sites  (p-value  = 
0.0021)  (Fig.  3B  and  C).  This  difference  in  expression 
is  similar  to  that  observed  for  the  endogenous  HP1Hsx 
gene  between  the  two  cell  types  (Kirschmann  et  al.,  2000). 
Deletions  that  removed  successive  amounts  of  5'  sequen¬ 
ces  showed  a  reduction  in  the  fold  change  in  expression 
between  the  two  cell  types,  suggesting  the  loss  of  regu¬ 
latory  sequences  (Fig.  3B).  In  particular,  the  differences  in 
expression  between  the  —  166/ +143  construct  in  the  two 
cell  types  were  barely  statistically  significant  (^-val¬ 
ue  =0.0608).  One  caveat  of  these  deletion  constructs  is 
that  vector  DNA  sequences  are  brought  into  close  prox¬ 
imity  to  the  HPlHm  promoter,  possibly  contributing  to 
regulation. 

To  better  identify  elements  involved  in  differential  reg¬ 
ulation  of  HP1Hs*  between  the  two  cell  types,  constructs 
containing  mutations  within  candidate  transcription  factor 
binding  sites  were  analyzed.  These  constructs  allowed  for 
the  retention  of  600  bp  of  upstream  sequences.  Of  particular 
interest  was  a  6EF1  binding  site  at  position  —  125  (Fig.  3A). 
This  site  can  be  bound  by  the  human  homolog  of  the 
chicken  5EF1  protein,  AREB6,  a  transcriptional  repressor 
(Ikeda  et  al.,  1998)  that  is  up-regulated  in  highly  invasive/ 
metastatic  cell  lines,  including  MDA-MB-23 1 ,  compared  to 
poorly  invasive/non-metastatic  cell  lines,  such  as  MCF-7 
(Kirschmann  et  al.,  1999).  Thus,  the  6EF1  binding  site  was  a 
promising  candidate  for  regulating  differential  expression  of 
HPIHsj.  A  mutation  in  the  conserved  6EF1  site  (mSEFl) 
was  constructed  by  replacing  the  4  bp  core  binding  site  and 
5  bp  of  surrounding  sequence  with  nine  cytosines  in  the 
context  of  the  —600/ +143  construct.  Differential  expres¬ 
sion  between  the  two  cell  lines  was  still  observed  (2.87-fold 
difference,  p-value  =  0.0004)  (Fig.  3C).  Therefore,  the  8EF 1 
site  does  not  appear  to  be  involved  in  differential  regulation 
of  HPlHm. 

Several  additional  candidate  transcription  factor-binding 
sites  were  also  investigated  for  their  effects  on  differential 
regulation.  These  include  a  c-myb  site  within  exon  one  of 
HpiHs “  (Fig.  3A).  Mutation  of  this  site  (mmyb),  replacing 
the  core  binding  region  and  surrounding  bases  with  nine 
cytosines,  retained  differential  expression  (nine-fold  dif¬ 
ference,  />va1ue  =  0.0124)  (Fig.  3C).  Furthermore,  muta¬ 
tions  of  two  USF/c-myc  sites  located  at  positions  -  109 
and  —  172,  designated  USFd  (distal)  and  USFp  (proxi- 
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Fig.  3.  (A)  Diagram  of  the  promoter  region  between  the  divergently  transcribed  hnRNPAl  and  HP1Hs*  genes.  Consensus  transcription  factor  binding  sites  that 
were  previously  shown  to  be  footprinted  by  protein  in  HeLa  cells  (Biamonti  et  al.,  1993)  are  indicated  by  black  lines.  Transcription  factor  binding  sites 
identified  by  our  bioinformatics  searches  are  indicated  by  the  name  above  the  boxes.  (B)  Results  of  luciferase  assays  from  5'  deletion  constructs  containing 
HP1Hs*  promoter  fragments  fused  to  a  luciferase  reporter  gene.  The  construct  names  indicate  the  5'  and  3'  sequence  boundaries.  The  constructs  were 
transfected  into  a  metastatic/highly  invasive  cell  line,  MDA-MB-23 1  and  a  nonmetastatic/poorly  invasive  cell  line,  MCF-7.  The  number  of  samples,  fold 
change  between  expression  in  MCF-7  and  MDA-MB-23 1  cells,  and  p-  values  are  indicated  for  each  construct.  (C)  Results  of  mutational  analysis  of  the  HP1h'x 
promoter  constructs.  Construct  names  reflect  the  site(s)  mutated.  Asterisks  mark  /7-values  that  show  no  statistical  difference  in  expression  between  the  two  cell 
lines,  indicating  a  loss  of  differential  regulation. 


trial),  respectively,  were  tested  for  effects  on  expression 
(Fig.  3A).  Mutations  in  the  individual  sites  (mUSFp  and 
mUSFd),  double  mutations  of  both  USF/c-myc  sites 
(mUSFpUSFd),  and  mutation  of  USFp  in  combination 
with  mutation  of  the  c-myb  site  (mmybUSFp)  were 
examined.  All  of  the  mutations  replaced  the  core  4  bp 
of  the  transcription  factor  binding  sites  and  surrounding 
sequence  with  eight  cytosines.  The  mUSFp  mutation 
retained  differential  expression  (5.38-fold  difference,  p- 
value  =  0.0137)  (Fig.  3C).  The  mmybUSFp  double  mutant 
also  retained  differential  expression  (7.12-fold  difference, 
p-value  =  0.0004).  In  contrast,  mUSFd  eliminated  differ¬ 
ential  expression  (0.99-fold  difference,  p- value  =  0.9882). 
Consistent  with  this  finding,  mUSFpUSFd  disrupted  dif¬ 
ferential  expression  (p-value  =  0.1756).  Thus,  the  USFd 
site  at  position  —  172  is  required  for  differential  expres¬ 
sion  of  HP1Hsu  between  MCF-7  and  MDA-MB-23 1  cell 
lines. 


3.3.  Human  HPlHsct  functions  similar  to  Drosophila  HP l 

In  addition  to  understanding  HPlHm  regulation,  we  also 
aimed  to  understand  the  functional  properties  of  this  protein. 
HP1  proteins  are  highly  conserved  between  species,  sug¬ 
gesting  related  functions.  HP1  was  first  identified  in  Dro¬ 
sophila  where  functional  studies  have  been  performed, 
demonstrating  a  role  in  gene  silencing  (Weiler  and  Waki- 
moto,  1995).  Three  human  HP1  family  members,  HP1Hs“, 
HP1Hs(J  and  HP1 Hs  /,  show  a  high  degree  of  amino  acid 
sequence  identity  with  Drosophila  HP1;  however,  it  is 
unclear  which  family  member  is  the  functional  homolog 
of  Drosophila  HP1.  HP1Hs7  shows  44%  overall  amino  acid 
sequence  identity  with  Drosophila  HP1,  60%  in  the  CD,  and 
38%  in  the  CSD.  These  values  are  similar  to  those  of 
HPlHs"  showing  44%  overall,  65%  in  the  CD,  and  43% 
in  the  CSD.  HPlHsp  has  slightly  greater  overall  amino  acid 
identity  to  Drosophila  HP1  than  HPlHs“  (46%  vs.  44%). 
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The  HPlHsp  CD  shows  slightly  more  identity  with  the 
Drosophila  HP1  CD  than  the  HP1Hs7  and  HP1Hs“  CDs 
(68%  vs.  60%  and  65%,  respectively).  In  contrast,  the 
HpjHsp  shows  less  identity  to  the  Drosophila  HP1 
CSD  than  the  HP1Hs“  CSD  (39%  vs.  43%).  In  sum, 
comparisons  of  the  amino  acid  sequences  of  human  and 
Drosophila  HP1  identified  only  minor  differences  in  the 
percent  identity  without  immediately  suggesting  a  function¬ 
al  homologue. 

Another  protein  feature  that  might  suggest  similar  func¬ 
tion  between  Drosophila  HP1  and  a  human  HP1  is  the 
chromosome  localization  pattern.  Drosophila  HP1  shows 
enrichment  at  heterochromatic  regions  and  localizes  to 
approximately  200  euchromatic  sites  on  larval  polytene 
chromosomes  (Fanti  et  al.,  2003).  HP1Hs“  and  HP1Hs(4 
predominantly  localize  to  centric  heterochromatin,  showing 
partial  overlap  with  anti-centromere  antibodies  (Mine  et  al., 
1999).  In  contrast,  HP1Hs7  localizes  to  centric  heterochro¬ 
matin  and  euchromatic  regions  (Mine  et  al.,  2000).  Based  on 
this  localization  data,  HP1Hs7  appears  to  have  a  more  similar 
pattern  to  that  of  Drosophila  HP1. 

To  investigate  the  functional  properties  of  the  HP1Hs“ 
protein  and  determine  whether  it  is  a  functional  homologue 
of  Drosphila  HP1,  we  generated  transgenic  Drosophila  that 
expressed  an  HPlHsa-EGFP  fusion  gene  under  the  control 
of  an  hsp70  heat  shock  promoter.  Homozygous  HP1Hs 
EGFP  larvae  were  heat  shocked  1  h  at  37 °C  and  allowed 


to  recover  for  2  h  at  room  temperature.  Salivary  glands 
were  dissected  from  the  larvae,  squashed  and  stained  with 
antibodies  that  recognize  EGFP  and  Drosophila  HP1.  The 
results  indicated  that  HP1Hs“-EGFP  localized  to  the  chro¬ 
mocenter  (the  site  of  fusion  of  all  the  centromeres),  the 
heterochromatic  fourth  chromosome,  and  euchromatic  sites 
in  a  pattern  that  completely  overlapped  with  endogenous 
Drosophila  HP1  (Fig.  4A).  It  was  possible  that  co-locali¬ 
zation  was  due  to  interactions  between  the  HP1Hs“  CSD 
and  the  Drosophila  HP1  CSD,  since  CSDs  have  been 
shown  to  dimerize  (Cowieson  et  al.,  2000).  Therefore, 
we  assayed  the  localization  of  HP1Hsoi-EGFP  in  larvae 
lacking  endogenous  Drosophila  HP1.  HP1Hs“-EGFP 
showed  the  same  pattern  of  localization  on  larval  polytenes 
with  or  without  endogenous  HP1,  indicating  that  HP1Hs“ 
associates  with  chromosomes  by  similar  mechanisms  as 
Drosophila  HP1  (Fig.  4B). 

Overexpression  of  Drosophila  HP1  enhances  silencing  of 
genes  repressed  by  heterochromatin  (Weiler  and  Wakimoto, 
1995).  To  test  whether  HP1Hs“-EGFP  has  a  similar  function, 
HP1Hs“-EGFP  was  overexpressed  (two-fold  over  endoge¬ 
nous  HPl)  in  stocks  carrying  a  tagged  hsp26  heat  shock 
gene  inserted  at  different  heterochromatic  locations.  Expres¬ 
sion  of  HP1Hs“-EGFP  by  daily  heat  shock  resulted  in  a  40% 
reduction  in  hsp26  expression  from  a  centric  transgene  (Fig. 
5).  Similarly,  expression  of  HP1Hs“-EGFP  resulted  in  a  50% 
reduction  in  hsp26  expression  from  a  telomeric  transgene 
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Fig.  4.  (A)  Polytene  chromosomes  from  third  instar  larvae  salivary  glands  from  a  stock  carrying  an  HPIHsj-EGFP  transgene.  The  chromosomes  were  fixed, 
squashed  and  stained  with  antibodies  against  Drosophila  HPl  (DmHPl;  red)  and  GFP  (green).  The  DmHPl  antibody  does  not  recognize  H P 1 1 by  Western 
blot  analysis  (data  not  shown).  Co-localization  is  observed  in  yellow.  (B)  Polytene  chromosomes  from  HPl  mutants  containing  the  HP1Hsj-EGFP  transgene 
stained  with  DmHPl  (red)  and  GFP  (green).  Arrowheads  denote  the  chromocenter  (formed  by  the  fusion  of  all  centromeres). 
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Fig.  5.  Effects  of  HPllls“-EGFP  expression  on  gene  silencing.  Flies 
expressing  the  HP1h'*-EGFP  transgene  were  crossed  to  flies  that  contain  a 
reporter  hsp26  transgene  inserted  at  a  heterochromatic  or  telomeric  location. 
RNA  was  isolated  from  the  heat-shocked  progeny  and  analyzed  by  northern 
analysis  with  radiolabeled  sequences  corresponding  to  the  hsp26  reporter 
gene  and  the  rp49  loading  control. 

(Fig.  5).  These  data  demonstrate  that  the  human  HP1Hs“ 
protein  participates  in  gene  silencing  and  has  similar  func¬ 
tions  as  Drosophila  HP1. 

Several  homozygous  lethal  mutations  exist  in  Su(var)2- 
5,  the  Drosophila  gene  encoding  HP1  (Eissenberg  and 
Hartnett,  1993).  These  mutations  were  used  to  determine 
whether  HPlHs“-EGFP  could  rescue  lethality.  Flies  carry¬ 
ing  the  HPl',s/-EGFP  transgene  that  were  heterozygous 
for  a  mutant  allele  of  the  gene  encoding  HP1  [Su(var)2- 
504]  balanced  over  a  chromosome  possessing  a  Curly  wing 
mutation  were  crossed  to  flies  heterozygous  for  a  second 
mutant  allele  of  the  gene  encoding  HP1  [Su(var)2-5n2] 
over  the  Curly  balancer  chromosome.  From  this  cross, 
Curly  wing  homozygotes,  25%  of  total  progeny,  die  as 
early  embryos.  Flies  heterozygous  for  the  Curly  wing 
balancer  chromosome  and  a  Su(var)2-5  allele,  50%  of 
the  total  progeny,  were  viable.  The  final  class  of  progeny, 
heteroallelic  for  the  Su(var)2-5  mutant  alleles,  constituting 
25%  of  the  total  progeny,  would  be  lethal  if  no  rescue  is 
observed.  Rescue  of  such  individuals  would  give  rise  to 
straight  wing  adults.  When  HP1h™-EGFP  was  expressed 
by  daily  heat  shock  treatment,  2%  (4/218  adults)  of  the 
total  progeny  had  straight  wings,  suggesting  partial  func¬ 
tion  of  the  HP1Hs“-EGFP  fusion  protein.  More  convincing, 
14%  (35/244  adults)  of  total  progeny  were  rescued  to 
adulthood  by  expressing  HPlHsa  without  the  EGFP  tag. 
These  data  suggest  that  despite  exhibiting  the  correct 
pattern  of  localization  and  gene  silencing  effects,  EGFP 
was  limiting  the  function  of  HPlHs“.  The  ability  of  the 
untagged  version  to  show  appreciable  rescue  (14%  vs. 
25%  for  complete  rescue)  strongly  suggests  that  HP  1  Hs“  is 
a  functional  homolog  of  the  Drosophila  HP1  protein. 


4.  Discussion 

4.1.  Conservation  of  gene  structure 

The  HP1  genomic  structure  is  conserved  from  Drosoph¬ 
ila  to  humans.  Drosophila  Su(var)2-5,  mouse  mHPla,  M31, 
and  M32,  and  human  HP1Hsx,  HP1HsI>,  and  HPlHsy  are  each 
comprised  of  five  exons  and  four  introns.  Translation  start 
begins  in  exon  two  in  Drosophila  Su(var)2-5,  mouse 
mHPla.  and  M31,  and  human  HP1Hsx,  HP1Hs &  and 
HPlHsy  In  contrast,  the  translation  start  of  M32  is  within 
exon  three.  Due  to  the  insertion  of  an  intron  within  exon 
one,  M32  exon  three  corresponds  to  that  of  exon  two  in  the 
other  HP1  genes  (Jones  et  al.,  2001).  The  CD  of  all  of  the 
mammalian  HP1  genes,  except  for  M3 2,  is  contained  in 
exons  two  and  three  (Jones  et  al.,  2001).  Exons  three  and 
four  of  M32  have  fused  to  become  exon  four.  Therefore,  the 
CD  of  M32  is  found  within  exons  three  and  four.  The  CSD 
of  all  of  the  mammalian  HP1  genes  is  found  in  exons  four 
and  five  (Jones  et  al.,  2001).  The  amino  acids  at  the  splice 
junctions  are  conserved  in  all  the  mammalian  HP1  family 
members,  except  M32,  but  are  distinctly  different  for  Dro¬ 
sophila  Su(var)2-5.  Therefore,  the  genomic  structure,  but 
not  the  intron/exon  boundaries,  are  conserved  from  Dro¬ 
sophila  Su(var)2-5  to  human  HP1  family  members. 

The  Su(var)2-5  gene,  at  cytological  position  28F1-2, 
encodes  a  protein  that  is  commonly  referred  to  as  Drosoph¬ 
ila  HP1  and  sometimes  referred  to  as  HP  la  (Smothers  and 
Henikoff,  2001).  There  are  two  additional  HPl-like  genes 
located  at  cytological  positions  87C  and  94C4,  called  HPlb 
and  HPlc,  respectively.  The  proteins  encoded  by  these 
genes  do  not  exhibit  a  chromosomal  distribution  pattern 
that  significantly  overlaps  with  HP1  (Smothers  and  Henik¬ 
off,  2001).  In  addition,  these  two  proteins  have  limited 
amino  acid  sequence  identity  with  HP1;  HPlb  is  44% 
identical  and  HPlc  is  31%  identical  to  HP1.  Furthermore, 
HPlb  and  HPlc  do  not  have  a  conserved  genomic  structure 
http://www.flybase.org/)  with  the  mammalian  members  of 
the  HP1  family,  as  does  Drosophila  HP1.  Thus,  HP1  was 
exclusively  used  for  comparisons  in  this  study. 

The  three  human  genes  encoding  HPlHsa,  HP1HsI>  and 
HPlHsy  located  at  12ql3,  17q21  and  7ql5,  respectively, 
have  several  corresponding  pseudogenes  present  in  the 
human  genome.  In  our  screen  of  the  human  genomic  library, 
we  recovered  one  processed  HPl,,sa  pseudogene  (data  not 
shown)  corresponding  to  sequences  on  chromosome  3p  11.1 
(http://www.ncbi.nlm.nih.gov/).  Five  pseudogenes  have 
been  previously  published  for  HPlm,  and  four  for  HPlHsy 
(Jones  et  al.,  2001).  The  pseudogenes  of  HPlHsli,  containing 
few,  if  any,  introns  are  found  on  chromosomes  lq32,  3q26, 
14q24,  Xp22,  and  Xqll  (http://www.ncbi.nlm.nih.gov/) 
suggesting  they  are  processed  pseudogenes.  Eleven 
HP1Hs~‘  pseudogenes  are  found  on  NCBI  Genbank,  four 
of  which  correspond  to  the  previously  published  pseudo¬ 
genes  (http://www.ncbi.nlm.nih.gov/;  Jones  et  al.,  2001). 
The  pseudogenes  of  HPlHsy,  containing  few,  if  any,  introns 
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are  found  on  chromosomes  2q24,  3p22,  5q22,  6q22.2, 
llpll,  llpl4,  1  lql4,  12pl3,  12q23,  16pl3,  and  18pll 
(http://www.ncbi.nlm.nih.gov/).  Thus,  each  of  the  functional 
human  HP1  family  members  is  encoded  by  separate  un¬ 
linked  genes  that  have  multiple  pseudogenes  scattered 
throughout  the  genome. 

4.2.  HPlHsy  regulation  in  breast  cancer  metastasis 

HPlHsa  is  down-regulated  in  highly  invasive/metastatic 
breast  cancer  cells  compared  to  poorly  invasive/non-meta¬ 
static  cells  (Kirschmann  et  al.,  2000).  This  regulation  likely 
occurs,  at  least  in  part,  at  the  transcriptional  level  and  does 
not  involve  differential  DNA  methylation.  An  analysis  of 
the  DNA  sequences  in  the  HP1Hsx  promoter  region  identi¬ 
fied  potential  binding  sites  for  transcriptional  regulators  that 
might  be  involved  in  differential  regulation.  Only  three  of 
the  binding  sites  shown  in  Fig.  3,  the  two  SP1  sites 
immediately  upstream  of  hnRNPAl,  the  CAAT  box  (posi¬ 
tion  —  244)  and  the  proximal  USF/c-myc  site  (position 
— 109),  are  conserved  between  mouse  and  humans;  none 
of  the  elements  can  be  identified  upstream  of  the  gene 
encoding  Drosophila  HP1. 

Mutation  of  a  8EF1  binding  site  (at  position  -  125), 
which  associates  with  the  AREB6  protein  found  to  be  up- 
regulated  in  highly  invasive/metastatic  breast  cancer  cells 
(Kirschmann  et  al.,  1999),  does  not  appear  to  be  involved  in 
the  differential  expression  of  HPlHsa.  Mutation  of  a  c-myb 
binding  site  at  position  +97  and  a  USF/c-myc  site  at 
position  -  109  does  not  appear  to  play  a  role  in  differential 
regulation  either.  In  contrast,  mutation  of  a  USF/c-myc  site 
at  position  —  172  abolishes  differential  regulation.  This 
USF/c-myc  was  protected  from  DNase  I  digestion,  suggest¬ 
ing  occupancy  by  protein(s)  in  HeLa  cells  (Biamonti  et  al., 
1993).  USF/c-myc  sites,  commonly  called  E-boxes,  are 
frequently  bound  by  a  variety  of  proteins,  including  USF 
and  Myc  family  members.  USF  proteins  are  involved  in 
both  gene  silencing  and  activation,  sometimes  at  the  same 
site  under  different  circumstances  (Goueli  and  Janknecht, 
2003).  Myc  proteins  are  also  involved  in  both  gene  activa¬ 
tion  and  repression,  depending  on  their  dimerization  partner 
(Queva  et  al.,  1998).  Therefore,  the  function  of  the  distal 
USF/c-myc  site  in  the  differential  regulation  of  HP1Hs*  is 
difficult  to  predict  and  warrants  further  investigation. 

4.3.  Function  ofHPlHm 

Our  data  strongly  suggest  that  HPlHsa  is  a  functional 
homolog  of  the  Drosophila  HP1  protein.  The  results 
showing  that  FIPlIls“  can  localize  to  the  same  sites  on 
polytene  chromosomes  as  Drosophila  HP1  are  consistent 
with  previously  published  results  (Ma  et  al.,  2001).  We 
extend  these  findings  by  demonstrating  that  HPlHsa 
exhibits  the  Drosophila  HP1  pattern  of  localization  even 
in  the  absence  of  Drosophila  HP1.  These  results  suggest  a 
conserved  mechanism  for  localization.  Previously  pub¬ 


lished  results  show  that  HP1Hs“  can  enhance  silencing 
induced  by  a  transgene  array  in  Drosophila  (Ma  et  al., 
2001).  These  arrays  have  similar,  yet  distinctly  different, 
properties  than  heterochromatin  (Prasad-Sinha  et  al., 
2000).  Our  results  clearly  demonstrate  that  HP1Hs“  can 
participate  in  heterochromatin  formation  and  silence  eu- 
chromatic  genes  placed  within  heterochromatin.  Thus, 
HpjHsc*  jias  ggne  sjiencjng  functions  similar  to  Drosophila 
HP1. 

Species  specificity  of  protein  function  can  be  addressed 
by  determining  whether  a  protein  from  one  organism  can 
provide  complete  function  of  the  homologous  protein  in 
another  organism.  HP1Hs“  can  rescue  the  lethality  of  a 
Drosophila  HP1  homozygous  mutant;  therefore,  FIP1Hs“  is 
a  functional  homolog  of  Drosophila  HP1.  In  contrast  to 
our  findings,  the  mouse  M31  protein  was  unable  to  rescue 
mutant  phenotypes  associated  with  S.  pombe  Swi6  mutants 
(Wang  et  al.,  2000).  Rescue  was  obtained,  however,  when 
the  Swi6  CSD  was  substituted  for  the  M3 1  CSD  (Wang  et 
al.,  2000).  The  overall  amino  acid  sequence  identity 
between  S.  pombe  Swi6  and  mouse  M31  is  37%.  This  is 
much  less  than  the  44%  overall  amino  acid  sequence 
identity  between  Drosophila  HP1  and  human  HPlHsa.  In 
particular,  the  amino  acid  sequence  identity  between  the  S. 
pombe  Swi6  CSD  and  the  mouse  M31  CSD  is  39%, 
whereas  the  amino  acid  sequence  identity  between  Dro¬ 
sophila  HP1  CSD  and  the  human  HP1Hs“  CSD  is  43%. 
Therefore,  the  CSD  of  Drosophila  HP1  and  human  HP1Hs“ 
is  more  conserved  than  the  CSD  of  S.  pombe  Swi6  and 
mouse  M3 1.  The  amino  acid  sequence  differences  between 
mouse  and  S.  pombe  might  explain  the  species-specificity 
observed. 

4.4.  Model  for  HP lHsu  function  in  breast  cancer  metastasis 

Given  the  conserved  function  of  HP1Hs“  in  gene  regula¬ 
tion,  one  possible  role  for  HP1Hs“  in  breast  cancer  metas¬ 
tasis  is  gene  silencing.  Accordingly,  the  HPlHsa  gene  would 
be  expressed  in  normal  and  primary  breast  cancer  tumor 
cells  where  it  produces  protein  that  functions  to  silence 
genes  required  for  invasion  and  metastasis.  In  highly  inva¬ 
sive/metastatic  breast  cancer  cells,  HPlHsa  expression  is 
reduced  and  less  HP1Hs“  protein  is  available  to  carry  out 
gene  silencing  functions.  Clearly  in  Drosophila  and  mice 
HP1  affects  gene  expression  in  a  dosage-dependent  manner 
(Weiler  and  Wakimoto,  1995;  Festenstein  et  al.,  1999). 
According  to  this  model,  loss  of  silencing  would  occur  at 
genes  encoding  proteins  that  are  required  for  invasion  and 
metastasis.  Therefore,  the  identification  of  HP1Hsm  regulated 
genes  is  a  goal  for  future  investigation. 
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Abstract 


Proper  regulation  of  gene  expression  is  required  for  normal  growth  and  development. 
Developmental  abnormalities  and  disease  can  result  from  the  misregulation  of  gene 
expression.  While  some  genes  are  clearly  controlled  by  mechanisms  of  activation,  a 
significant  proportion  of  genes  are  controlled  by  mechanisms  of  silencing.  Silencing 
mechanisms  can  account  for  the  inactivation  of  the  approximately  one  thousand  genes  on 
the  human  X-chromosome,  for  uniparental  inactivation  of  chromosomal  domains,  and  for 
inactivation  of  individual  genes.  Common  features  of  silencing  events  include  initiation 
by  RNAs,  either  as  non-coding  RNAs  expressed  from  a  single  gene  that  remain 
associated  near  the  site  of  synthesis  or  as  small  interfering  RNA  molecules  that  act  in 
trans.  Other  common  epigenetic  features  include  specific  histone  modifications,  which 
generate  a  code  for  silent  chromatin,  and  DNA  methylation.  Non-histone  chromosomal 
proteins  have  been  discovered  that  associate  with  specific  histone  modifications.  These 
non-histone  chromosomal  proteins,  “translators”  of  the  histone  code,  include 
Heterochromatin  Protein  1  and  Polycomb,  two  factors  that  play  a  role  in  organizing 
chromatin  structure.  This  review  discusses  discoveries  made  on  gene  silencing  systems 
and  their  connections  with  disease. 


2 


Gene  silencing  phenomena 


Expression  of  genes  within  the  human  genome,  estimated  to  contain  approximately 
25,000  genes,  requires  proper  regulation.  It  would  be  deleterious  for  a  cell  to  express  its 
full  complement  of  genes  at  once.  Therefore,  mechanisms  to  differentially  activate  and 
silence  genes  are  used  to  coordinate  gene  expression  on  the  basis  of  developmental 
timing  and/or  cell-type  specificity.  Mechanisms  of  gene  activation  have  been  well 
studied1,  however,  mechanisms  of  gene  repression,  sometimes  referred  to  as  gene 
silencing,  are  just  beginning  to  be  understood.  This  review  discusses  classical  cases  of 
gene  silencing  as  well  as  modem  molecular  discoveries  that  have  revealed  mechanisms  of 
gene  silencing.  As  anticipated  from  early  studies  of  genome  packaging,  chromatin- 
associated  proteins  play  a  central  role  in  the  process  of  gene  regulation,  where  an 
epigenetic  “code”  for  silencing  has  emerged.  Unanticipated  newcomers  involved  in  gene 
silencing  include  non-coding  and  small  interfering  RNA  molecules.  Taken  together,  the 
molecular  basis  for  gene  silencing  is  beginning  to  be  deciphered,  yet  much  remains  to  be 
understood. 

Position  effect  variegation 

In  the  1920’s  and  1930’s,  classical  genetic  analysis  was  carried  out  by  subjecting 
Drosophila  (fruit  flies)  to  X-rays.  Through  such  early  mutagenesis  experiments,  Muller 
identified  a  special  class  of  mutant  flies  called  “eversporting”  that  possessed  red  and 
white  mosaic  eyes,  with  red  being  the  wild  type  color2.  Later,  cytological  and  molecular 
data  showed  that  Muller’s  eversporting  mutant  phenotype  was  due  to  chromosomal 
translocations  (relocation  of  chromosomal  fragments  to  a  new  location  in  the  genome)  of 
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the  white  gene,  which  encodes  a  protein  required  to  generate  red  eyes3.  Translocation  of 
the  white  gene  near  a  centromere  results  in  silencing  in  a  subset  of  cells  (Figure  1A). 

This  gene  silencing  phenomenon  was  designated  position  effect  variegation  (PEV)3 
because  it  was  caused  by  the  new  position  of  the  gene  in  the  genome,  rather  than  a 
mutation  in  the  sequence  of  the  gene. 

Following  the  discovery  of  PEV,  several  genetic  screens  were  performed  in  Drosophila  to 
identify  dominant  mutations  that  enhance  [Enhancer  of  variegation,  E(var)s ]  or  suppress 
[Suppressors  of  variegation,  Su(  varjs]  PEV  (Figure  1A)4,5.  Some  Su(var)  loci  were 
found  to  be  haplo-suppressors  and  triplo-enhancers  of  PEV5.  Loss  of  one  copy  of  the 
Su(var)  gene  suppresses  variegation  and  addition  of  an  extra  copy  of  the  Su(var)  gene 
enhances  variegation,  indicating  a  dosage  sensitivity  of  the  variegating  gene  for  the 
products  of  these  Su(var)  genes.  Such  findings  suggest  that  critical  levels  of  modifier 
proteins  are  essential  for  proper  gene  expression.  Many  of  the  E(var)  and  Su(var)  genes 
have  been  found  to  encode  chromatin  proteins  and  enzymes  that  modify  other  chromatin 
proteins,  suggesting  a  role  in  gene  regulation. 

PEV  is  not  restricted  to  Drosophila,  as  examples  have  been  observed  in  yeast  and  mice. 

In  yeast,  genes  inserted  near  centromeres,  telomeres,  and  the  silent  mating  type  locus 
exhibit  variegated  expression6'8.  PEV  is  best  demonstrated  by  red  and  white  sectored 
yeast  colonies  that  occur  when  an  ade6  transgene  inserted  near  centric  regions  is  silenced, 
leading  to  red  coloration  within  the  yeast  colony2  (Figure  IB).  Likewise,  in  mice, 
transgenes  inserted  near  centromeres  show  PEV.  This  has  been  demonstrated  for  a  CD2 
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transgene  lacking  a  regulatory  region  that  protects  it  from  genomic  position  effects9, 10. 
The  CD2  transgene  encodes  a  transmembrane  receptor  that  is  expressed  on  a  cell 
autonomous  basis.  When  CD2  is  inserted  near  a  centromere,  PEV  is  observed  as  a 
subpopulation  of  cells  lacking  CD2  receptors  on  their  surface9  (Figure  1C).  Thus, 
genomic  regions  such  as  those  surrounding  centromeres  in  diverse  species  share 
properties  that  give  rise  to  PEV. 

X-chromosome  dosage  compensation 

In  many  species,  males  and  females  have  different  numbers  of  sex  chromosomes.  In 
order  to  compensate  for  differential  expression  of  genes  on  sex  chromosomes,  processes 
of  dosage  compensation  have  evolved".  Dosage  compensation  is  facilitated  by  different 
mechanisms  for  many  of  the  classical  model  organisms  where  X-chromosome  number 
varies  between  males  and  females.  In  insects,  such  as  Drosophila,  genes  on  the  X- 
chromosome  in  males  are  expressed  at  two-fold  greater  levels  than  from  the  female  X". 

In  the  worm  Caenorhabditis  elegans,  genes  on  both  female  X-chromosomes  are  down- 
regulated  to  half  the  expression  level  of  the  single  male  X".  A  different  strategy  is  taken 
in  mammals,  where  genes  on  one  of  the  female  X-chromosomes  are  inactivated,  resulting 
in  expression  from  the  one  active  X-chromosome  equaling  that  observed  from  the  single 
male  X-chromosome11.  In  all  cases,  the  net  result  is  to  balance  expression  of  genes  on  the 
X-chromosome  between  the  sexes,  although  the  means  to  achieve  this  balance  occurs  by 
different  mechanisms.  The  species-specific  strategies  to  achieve  this  balance  have 
something  in  common:  they  all  utilize  aspects  of  chromatin  packaging  to  change  the 
expression  of  X-linked  genes  at  the  level  of  transcription. 
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From  cytological  analyses  of  the  mammalian  inactive  X  (Xi),  also  known  as  the  Barr 
body,  alterations  in  chromatin  packaging  are  clearly  associated  with  gene  silencing12,  l3. 
The  Xi,  which  becomes  inactivated  in  the  embryo  before  implantation  of  the  blastocyst, 
forms  a  compact,  mitotically  stable  structure  localizing  to  the  nuclear  periphery1  '■ 13.  In 
some  mammals,  such  as  marsupials,  the  paternal  X-chromosome  is  always  designated  as 
the  chromosome  to  undergo  Barr  body  formation11.  In  placental  mammals,  such  as  cats 
and  humans,  the  paternal  X-chromosome  is  always  inactivated  in  extraembryonic  tissues, 
but  within  the  embryonic  tissue  the  Xi  is  randomly  chosen  between  the  paternally  and 
maternally  inherited  X-chromosomes11.  Random  inactivation  can  lead  to  the  mosaic 
expression  of  X-linked  alleles  within  an  individual,  hence  the  molecular  basis  of  the 
female  calico  cat14. 

Mosaicism  at  the  individual  level  occurs  when  the  paternal  X-chromosome  is  inactivated 
in  a  fraction  of  the  cells  and  the  maternal  X-chromosome  is  inactivated  in  the  remainder 
of  the  cells,  leading  to  different  cells  within  an  individual  expressing  different  alleles  of 
the  same  gene15.  X-inactivation  can  have  implications  for  human  disease.  For  example, 
Bum  et  al  (1986)  reported  monozygotic  twin  girls,  only  one  of  which  displayed 
phenotypes  associated  with  the  X-linked  disease  Duchenne  Muscular  Dystrophy 
(DMD)16.  In  the  affected  twin,  the  paternally  inherited  X-chromosome  was  inactivated  in 
the  majority  of  cells16.  In  contrast,  in  the  unaffected  twin  the  maternally  inherited  X- 
chromosome  was  inactivated  in  the  majority  of  cells16.  These  data  strongly  imply  that  the 
mutant  allele  causing  DMD  was  maternally  inherited  and  that  non-random  X-inactivation 
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resulted  in  DMD  in  the  affected  twin.  This  phenomenon  is  called  “skewing”  and 
involves  non-random  X-chromosome  inactivation  within  an  individual.  Usually  skewing 
contributes  to  a  lower  percentage  of  females  with  X-linked  diseases15. 

X-chromosome  inactivation  typically  involves  the  majority  of  genes  on  the  chromosome, 
with  the  exception  of  a  class  known  as  “escapers”.  For  the  most  part,  escaper  genes 
neither  share  common  expression  patterns  nor  encode  proteins  with  related  functions. 
Their  chromosomal  positions  cluster,  though,  with  the  majority  mapping  to  the  small  arm 
of  the  X-chromosome,  suggesting  chromosome  domain  organization14.  Whether  these 
genes  escape  inactivation  or  are  reactivated  following  initial  gene  silencing  is  not  clear. 
Recent  data  favor  a  mechanism  whereby  escaper  genes  lack  the  ability  to  maintain 
silencing14. 

Genomic  imprinting 

Imprinting  is  the  process  whereby  a  gene  on  an  autosomal  chromosome  has  a  different 
expression  pattern  depending  on  whether  it  is  inherited  maternally  or  paternally.  This 
mechanism  of  gene  regulation  is  found  in  both  animals  and  plants17.  In  contrast  to  X- 
chromosome  inactivation  that  is  nearly  chromosome-wide,  autosomal  imprinting  only 
affects  a  small  number  of  genes  per  chromosome18.  These  imprinted  genes  are  frequently 
arranged  in  clusters,  such  as  in  the  Beckwith-Wiedemann  syndrome  (BWS)  cluster  that 
spans  one  Mb  of  DNA  on  chromosome  1 1  and  contains  1 5  genes,  nine  of  which  are 
imprinted18,  ,9. 
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In  mammals,  improper  expression  or  mutation  of  imprinted  genes  generally  leads  to 
growth  defects,  impairment  of  brain  function,  and  cancer.  For  example,  BWS  in  humans 
is  associated  with  alterations  in  the  expression  of  the  two  imprinted  genes,  IGF2  and 
CDKN1C  within  the  BWS  cluster  on  chromosome  eleven19.  Misregulation  of  genes 
within  this  cluster  lead  to  developmental  defects  and  a  predisposition  to  Wilm’s  tumor19, 

20 .  IGF2  is  normally  expressed  from  the  paternally  inherited  chromosome;  CDKN1C  is 
normally  expressed  from  the  maternally  inherited  chromosome19.  Uniparental  inheritance 
of  the  BWS  region  from  the  male,  chromosomal  translocations  that  remove  an  allele  of 
either  imprinted  gene,  or  point  mutations  within  the  maternal  CDKN1C  gene  that  disturb 
its  expression  all  lead  to  disruption  of  the  imprinted  pattern,  resulting  in  BWS19. 

Chromatin  structure  regulates  gene  expression 

Silencing  genes  through  PEV,  X-inactivation,  imprinting  and  other  phenomena  is  directly 
linked  to  chromatin  packaging.  In  eukaryotic  cells,  the  genome  is  packaged  into  higher 
order  chromatin  structures,  with  the  smallest  unit  of  chromatin  packaging  being  the 
nucleosome.  The  nucleosome  consists  of  146  ±  2  bp  of  DNA  (1.7  turns  of  the  DNA 
superhelix)  wrapped  around  an  octamer  of  histones  (a  tetramer  of  the  histones  H3/H4  and 
two  dimers  of  the  histones  H2A/H2B  dimer)21,22.  Nucleosomes  wrap  the  DNA,  leaving 
20-80  bp  of  double  stranded  DNA  to  serve  as  a  linker  between  nucleosomes;  the  linker 
size  varies  according  to  the  organism  and  genomic  context23.  The  interaction  between 
DNA  and  the  core  histones  results  in  a  five-fold  compaction  of  the  DNA24.  Nucleosome 
placement  with  respect  to  gene  sequences  is  critical  for  determining  whether  a  gene  will 
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be  expressed.  Nucleosomes  positioned  over  TATA  boxes  and/or  regulatory  elements  can 
block  access  of  RNA  polymerase  and  result  in  gene  repression25.  In  some  cases, 
chromatin-remodeling  machines  are  necessary  to  slide  or  displace  nucleosomes  from 
genes  prior  to  activation26. 

Further  condensation  of  nucleosome-associated  DNA  is  facilitated  by  histone  HI,  which 
binds  linker  DNA,  compacting  chromatin  into  a  30  nm  fiber24.  In  general,  association  of 
HI  correlates  with  gene  repression27.  Packaging  beyond  the  30  nm  fiber  requires 
additional  non-histone  chromosomal  proteins23.  How  these  proteins  generate  higher  order 
chromatin  structures,  such  as  those  observed  near  centromeres,  is  a  topic  under 
investigation. 

In  most  eukaryotic  organisms  the  genome  is  packaged  into  two  general  types  of 
chromatin:  euchromatin  and  heterochromatin.  Euchromatin  contains  the  majority  of 
genes  and  is  packaged  into  irregular  nucleosome  arrays.  Nucleosome-free  regions  are 
detected  as  sites  hypersensitive  to  nucleases  and  usually  map  to  the  regulatory  regions  of 
genes28.  In  contrast,  heterochromatin  contains  large  quantities  of  repetitive  elements  in 
the  genome  and  is  packaged  into  very  regular  nucleosome  arrays28.  Heterochromatic 
regions  are  typically  located  near  centromeres  and  telomeres  and  remain  as  condensed 
chromatin  throughout  the  cell  cycle.  With  these  data  in  mind,  it  is  understandable  how 
placement  of  a  euchromatic  gene  into  heterochromatin  alters  gene  expression,  as 
observed  for  PEV.  The  translocated  gene  appears  to  take  on  the  packaging  state  of  the 
new  genomic  environment. 
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Epigenetic  mechanisms  of  gene  regulation 


Euchromatin  and  heterochromatin  are  not  only  distinguished  by  DNA  sequence  content 
and  nucleosome  array  organization,  but  also  by  modifications  of  amino  acid  residues 
present  in  histones  proteins.  In  particular,  the  amino  terminal  tails  of  histones  that  extend 
from  the  nucleosome  core  are  post-translationally  modified  in  numerous  ways.  Particular 
modifications  have  been  shown  to  correlate  with  specific  functions  (Table  l)29,30.  These 
modifications  serve  as  epigenetic  marks  that  can  be  perpetuated  through  cell  divisions31. 
One  hypothesis  states  that  these  modifications  represent  a  “histone  code”,  serving  as 
docking  sites  for  specific  proteins  that  regulate  chromatin  structure,  gene  expression, 
entry  into  mitosis  and  additional  functions31.  A  second  hypothesis  suggests  that  the 
histone  modifications  regulate  the  levels  of  compaction  and  define  large-scale  domains 
on  chromosomes32.  These  hypotheses  are  not  mutually  exclusive  and  suggest  multiple 
mechanisms  to  control  gene  expression. 

Extensive  analysis  has  been  carried  out  on  the  modifications  that  occur  at  lysine  residues 
within  the  histone  H3  and  H4  amino-terminal  tails.  Active  transcription  is  typically 
associated  with  acetylation  of  lysines  9  and  14,  and  methylation  of  lysine  4  of  histone  H3 
tails29.  These  modifications  are  generated  by  histone  acetyltransferases  (HATs)  and 
histone  methyltransferases,  respectively.  In  contrast,  gene  silencing  is  frequently 
associated  with  hypoacetylation  of  histone  H3  and  H4  tails,  as  well  as  methylation  of 
lysine  9  of  histone  H331.  In  accordance  with  these  “codes”,  euchromatin  and 
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heterochromatin  have  distinct  histone  modifications  corresponding  to  their  relative 
transcriptional  activity. 

Connections  have  been  made  between  histone  modifications  and  another  epigenetic 
regulator  of  gene  expression,  DNA  methylation.  Cytosine  residues  within  the  genome 
can  be  methylated  by  DNA  methyl  transferases.  This  modification  is  typically  associated 
with  gene  silencing.  The  mechanism  of  silencing  involves  the  recruitment  of  methyl- 
DNA  binding  proteins  to  the  methylated  cytosines,  which  in  turn  recruit  histone 
deacetylases  (HDACs)33.  Therefore,  sites  of  DNA  methylation  can  direct  patterns  of 
histone  deacetylation  throughout  the  genome. 

Similar  to  the  targeting  of  histone  deacetylation,  DNA  methylation  has  also  been  linked 
with  histone  methylation  in  Neurospora.  A  genetic  screen  performed  to  identify 
mutations  that  disrupt  DNA  methylation  identified  a  histone  methyltransferase34.  Adding 
further  support  for  this  connection,  the  maintenance  DNA  methyltransferase  mutant 
imetl)  in  Arabidopsis  shows  a  reduction  in  the  amount  of  histone  H3  lysine  9 
methylation35.  Taken  together,  these  data  indicate  that  DNA  methylation  can  be 
dependent  on  histone  methylation  and  vice  versa.  Such  findings  invoke  a  multi-tiered 
regulation  system,  with  at  least  two  components  (histone  and  DNA  methylation)  that  can 
be  mitotically  passed  on  to  ensure  maintenance  of  the  silent  state. 

Methylation  is  one  of  the  most  stable  covalent  histone  modifications  identified  and  is 
thought  to  serve  as  a  “memory”  for  silent  chromatin36.  No  histone  demethylase  has  been 
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discovered  to  date36.  Instead,  removal  of  the  histone  mark  may  occur  through  active 
histone  replacement,  cleavage  of  the  histone  tail,  or  dilution  of  the  histone  methylation 
through  successive  rounds  of  DNA  replication  and  chromatin  packaging37.  Acetylation, 
on  the  other  hand,  is  a  highly  dynamic  histone  modification.  HATs  and  HDACs  alter  the 
state  of  acetylation  of  histones,  leading  to  transcriptional  activation  or  repression, 
respectively37.  Such  features  allow  for  additional  levels  of  gene  regulation. 

Translators  of  the  histone  code 

Bromodomain  proteins 

Support  for  the  histone  code  hypothesis  came  from  discoveries  that  proteins  with  defined 
molecular  functions  recognize  specific  histone  modifications,  thereby  translating  the  code 
into  a  biological  process.  Studies  of  the  bromodomain,  a  conserved  domain  found  in  a 
variety  of  chromatin  associated  proteins,  show  that  it  specifically  interacts  with  acetylated 
lysines  of  histone  H3  and  H438’40.  Bromodomain-containing  proteins,  such  as  GCN5, 
Brahma  and  TAFn250,  mediate  several  important  cellular  functions,  such  as 
transcriptional  activation,  maintenance  of  transcriptional  activation  through  mitosis,  and 
prevention  of  the  spreading  of  silent  chromatin39,40.  For  example,  TAFn250  interacts  with 
the  acetylated  lysine  8  residue  of  histone  H4  and  the  acetylated  lysine  9  and  14  residues 
on  histone  H339.  Association  of  TAF„250  with  acetylated  histones  results  in  the 
recruitment  of  TF„D  and  activation  of  nearby  gene  expression40. 

Acetylated  histone  tails  are  not  the  only  substrate  for  interactions;  bromodomain- 
containing  proteins  also  interact  with  other  proteins  that  contain  acetylated  lysine 
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residues41.  In  many  cases,  these  proteins  are  DNA-binding  transcriptional  activators. 
Interactions  with  bromodomain  containing  proteins  are  hypothesized  to  stabilize  binding 
to  gene  regulatory  regions,  thereby  promoting  gene  expression41. 

Heterochromatin  Protein  1  and  Polvcomb 

Just  as  the  bromodomain  translates  histone  acetylation  into  gene  activation,  the  chromo 
domain  (CD)  translates  histone  methylation  into  gene  silencing.  Chromo  domain 
proteins  such  as  Heterochromatin  Protein  1  (HP1)  and  Polycomb  (Pc)  bind  methylated 
lysine  residues  on  the  histone  H3  tail.  The  genes  encoding  HP1  and  Pc  were  connected 
to  gene  silencing  through  genetic  screens  carried  out  in  Drosophila.  A  screen  for 
modifiers  of  PEV  identified  the  Su(var)2-5  gene  that  encodes  HP142,  and  a  screen  for 
mutations  that  give  rise  to  homeotic  phenotypes  led  to  the  discovery  of  Pc43.  While  HP1 
and  Pc  Group  (PcG)  proteins,  such  as  Polyhomeotic  (Ph),  are  clearly  involved  in  gene 
silencing,  they  localize  to  non-overlapping  sites  in  the  genome,  as  visualized  by 
immunostaining  on  Drosophila  polytene  chromosomes  (Figure  2),  suggesting  distinct 
targets. 

The  amino  acid  sequences  of  HP1  proteins  are  conserved  among  species.  This 
conservation  primarily  resides  within  two  domains,  the  amino-terminal  CD  and  the 
structurally  similar  carboxy-terminal  chromo  shadow  domain  (CSD)44.  The  CD  and  CSD 
are  separated  by  a  poorly  conserved  hinge  region  (Figure  3A).  A  hydrophobic  pocket 
formed  by  the  CD  serves  as  an  interaction  site  for  di-  and  tri-methlyated  lysine  nine 
residues  on  histone  H345,46  (Figure  3C).  This  modification  is  generated  by  the  SET- 
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domain  histone  methyltransferase  SU(VAR)3-9.  Association  with  the  methyl  mark  is 
clearly  the  mechanism  used  for  localization  of  the  majority  of  HP1  to  centric  locations, 
however,  recent  data  suggest  that  HP1  uses  alternate  mechanisms  at  other  genomic  sites47 
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The  hinge  region  is  unstructured  in  solution  and  is  thought  to  enable  the  chromo  and 
chromo  shadow  domains  to  move  independently  of  each  other50,51.  Although  the  hinge 
region  is  unstructured,  it  plays  a  role  in  HP1  function  as  a  nuclear  localization  signal 
required  for  active  transport  of  the  protein  into  the  nucleus52.  In  addition,  the  hinge 
interacts  with  DNA  and  other  nuclear  proteins,  such  as  histone  HI  and  HDACs,  which 
function  in  gene  silencing53'56.  The  hinge  of  HP1  has  also  been  implicated  in  RNA 
binding,  which  appears  to  be  required  for  pericentric  heterochromatin  localization  in 
mammalian  cells57. 

The  CSD  is  structurally  similar  to  the  CD,  also  forming  a  hydrophobic  pocket58.  In 
contrast  to  the  CD,  the  CSD  pocket  has  not  been  found  to  bind  to  modified  histone  tails, 
rather  the  CSD  dimerizes  with  another  CSD  on  a  second  molecule  of  HP1  through  its  a- 
helical  region.  Homodimerization  generates  a  surface  for  interaction  with  a  variety  of 
nuclear  proteins  that  contain  a  pentapeptide  motif,  PxVxL  50,58.  These  proteins  possess  a 
wide  variety  of  functions,  including  chromatin  remodeling,  DNA  replication,  and 
transcription  control50,59'61.  The  role  of  HP1  in  connection  with  many  of  these  proteins  is 
not  well  understood. 
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What  is  the  mechanism  of  HP  1 -mediated  gene  silencing?  Two  approaches  have  been 
taken  to  address  this  question  using  Drosophila.  In  one  approach  a  euchromatic 
transgene  was  mobilized  into  both  euchromatic  and  heterochromatic  regions  of  the 
genome.  Transcription  and  chromatin  structure  analysis  has  been  performed  on  the 
transgenes  in  both  types  of  environments.  Silenced  transgenes  inserted  into 
heterochromatin  appear  less  accessible  to  nuclease  digestion  and  package  into  more 
regular  nucleosome  arrays  than  the  expressed  euchromatic  transgenes62,63.  Lower  doses 
of  HP1  relieved  silencing  of  transgenes  inserted  into  heterochromatin  and  produced  a 
more  open  chromatin  structure64.  Thus,  one  possible  role  of  HP1  in  heterochromatin  is  to 
compact  chromatin,  possibly  through  CSD-CSD  interactions,  preventing  regulatory 
factors  from  associating  with  their  target  sites.  Supporting  this  idea,  high-resolution 
footprint  analyses  revealed  a  lack  of  transcription  factors  and  paused  polymerase  at  the 
promoters  of  transgenes  inserted  into  heterochromatic  regions,  whereas  association  at  the 
promoters  of  transgenes  inserted  into  euchromatin  was  evident25.  Thus,  HP  1 -mediated 
gene  silencing  appears  to  function  at  the  level  of  transcription  initiation  prior  to  the 
recruitment  of  trans-acting  factors  (Figure  4A).  Consistent  with  these  findings,  studies  in 
mammals  show  that  transcription  factors  associated  with  genes  on  the  active  X- 
chromosome  are  absent  from  genes  on  the  Xi65. 

The  second  approach  taken  in  Drosophila  to  address  mechanisms  of  HP  1 -mediated 
silencing  employs  a  tethering  system.  In  this  system,  HP1  was  fused  to  the  DNA  binding 
domain  of  the  LacI  repressor  from  E.  coli66  6y.  The  LacI-HPl  fusion  protein  was 
expressed  in  flies  carrying  lac  operator  repeats  upstream  of  a  reporter  gene.  Association 
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of  LacI-HPl  with  the  lac  operator  repeats  resulted  in  silencing  of  the  reporter  gene  and 
corresponded  to  alterations  in  chromatin  structure,  including  the  generation  of  regular 
nucleosome  arrays66  67.  These  studies  demonstrate  that  HP1  is  sufficient  to  nucleate  the 
formation  of  silent  chromatin,  even  in  the  absence  of  repeat  sequences  typically  found 
within  heterochromatin. 

Polycomb  protein  (Pc)  associates  with  chromatin  in  a  manner  similar  to  that  of  HP1.  The 
chromo  domain  of  Pc  forms  a  hydrophobic  cleft  that  binds  tri-methylated  lysine  27  of 
histone  H3  with  high  affinity  (Figure  3D),  and  tri-methylated  lysine  9  of  histone  H3  with 
lower  affinity68.  Methylation  at  these  residues  is  carried  out  by  E(Z),  a  component  of  the 
Esc-E(Z)  complex69  71.  Based  on  structural  studies,  it  appears  that  Pc  dimerizes  and  that 
two  Pc  molecules  are  unlikely  to  associate  with  one  nucleosome  (i.e.  on  both  H3  tails 
within  a  nucleosome)68.  Therefore,  two  Pc  molecules  associated  with  adjacent 
nucleosomes  might  dimerize  to  compact  chromatin,  bringing  about  gene  silencing68. 

What  is  the  mechanism  of  Pc-mediated  gene  silencing?  To  address  this  question,  a 
Drosophila  transgene  system  was  developed72.  A  reporter  gene  with  an  upstream 
Polycomb  Response  Element  (PRE),  the  cA-element  that  targets  the  Pc  complex,  was 
inserted  into  the  genome.  Association  of  the  Pc  complex  with  the  PRE  led  to  gene 
silencing  and  a  reduction  in  nuclease  accessibility  of  the  transgene  promoter  region72. 
Surprisingly,  chromatin  immunoprecipitation  experiments  revealed  that  trans- acting 
factors  remained  associated  with  regulatory  elements  in  the  transgene  and  that  the  paused 
polymerase  maintained  association  with  the  silenced  promoter.  However,  evidence  of  the 
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polymerase  being  engaged  for  transcription  was  absent72.  Thus,  Pc  silencing  is  likely  to 
block  steps  in  initiation  after  recruitment  of  the  polymerase  (Figure  4B). 


In  summary,  there  are  many  similarities  and  differences  between  HP1-  and  Pc-mediated 
gene  silencing.  Both  HP1  and  Pc  interact  with  chromosomes  through  a  methylated 
residue  of  a  histone  H3  tail,  though  utilizing  different  residues.  Biophysical  and  in  vivo 
localization  studies  support  the  idea  that  both  HP1  and  Pc  homodimerize,  which  might 
allow  for  inter-nucleosomal  interactions58,68.  HP1  and  Pc  both  have  a  conserved  carboxy 
domain,  the  CSD  and  Pc  box,  respectively,  that  interacts  with  a  wide  variety  of  nuclear 
factors48,73  (Figure  3A,  B).  Thus,  HP1  and  Pc  can  be  thought  of  as  bridging  molecules 
that  link  the  chromosome  with  other  factors  required  for  proper  chromosome  dynamics. 
While  HP1  and  Pc  use  a  similar  mechanism  for  localization,  cytological  and  genetic 
studies  demonstrate  that  they  have  different  targets.  In  addition,  their  mechanisms  of 
silencing  appear  to  be  distinct.  HP  1 -mediated  silencing  appears  to  prevent  association  of 
trans- acting  factors,  whereas  Pc-mediated  silencing  allows  association,  but  not  activation 
(Figure  4).  One  explanation  for  this  apparent  difference  in  mechanism  of  silencing  might 
be  a  reflection  of  when  each  protein  acts  in  development.  HP1  is  presumed  to  act  early  in 
embryogenesis  at  the  time  when  heterochromatin  forms,  and  Pc  acts  later  in 
embryogenesis  to  silence  homeotic  gene  expression  at  subsequent  developmental  stages. 
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Connections  between  gene  silencing  and  RNA 


Non-coding  RNAs 

In  recent  years  there  is  overwhelming  support  for  a  role  of  RNAs  in  the  regulation  of 
gene  silencing.  In  this  section  we  will  focus  on  RNAs  that  carry  out  a  role  in  gene 
silencing  by  association  with  the  silenced  gene  itself.  These  RNAs  are  typically  non¬ 
coding  (ncRNAs),  meaning  that  they  lack  protein-encoding  capacity. 

Non-coding  RNAs  play  a  pivotal  role  in  X-chromosome  inactivation.  In  undifferentiated 
cells  X-inactive  specific  transcript  ( Xist )  initiates  from  the  X-inactivation  center  (Xic)11. 
This  17  kb  message  is  spliced,  polyadenylated  and  spreads  along  the  X-chromsome  in 
cis.  In  addition  to  Xist,  Tsix  a  second  ncRNA,  is  transcribed  from  the  opposite  strand 
along  the  length  of  the  Xist  gene74.  At  the  onset  of  differentiation,  Xist  is  exclusively 
expressed  from  the  X-chromosome  selected  for  inactivation,  while  Tsix  is  exclusively 
expressed  from  the  X-chromosome  that  will  remain  active.  Expression  of  Xist  triggers  a 
series  of  epigenetic  modifications  on  the  Xi  that  correlate  with  gene  silencing  (Figure 
5A).  One  of  the  first  events  to  occur  is  the  Xisf-dependent  transient  association  of  the 
Eed-Enxl  complex,  a  homologue  of  the  Polycomb  complex  ESC-E(Z),  that  tri¬ 
methylates  lysine  27  of  histone  H375,76.  Subsequent  histone  modifications  include 
hypoacetylation  of  histone  H3  and  H4  and  methylation  of  lysine  9  of  histone  H377.  As 
predicted  by  the  histone  code,  HP1  is  enriched  on  the  Xi13.  A  late  event  that  accompanies 
stable  expression  of  Xist  is  the  enrichment  of  the  non-replicative  replacement  histone 
variant  macroH2A  and  DNA  methylation.  macroH2A  and  many  of  the  histone 
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modifications  specific  for  the  Xi  form  patterns  of  domains  along  the  length  of  Xi77. 
Interestingly,  domains  on  the  Xi  containing  escaper  genes,  such  as  the  pseudoautosomal 
region,  show  reduced  amounts  of  the  histone  modifications  associated  with  silencing14. 
Taken  together,  the  combination  of  ncRNA,  histone  modifications  and  histone  variants 
might  be  evidence  for  further  complexity  in  the  code,  which  is  translated  into  domains  of 
genes  with  different  levels  of  expression. 

In  addition  to  their  role  in  X-inactivation,  non-coding  RNAs  are  involved  in  genomic 
imprinting.  The  lgf2r  gene  cluster  contains  three  protein-coding  imprinted  genes, 
including  Igflr,  that  are  expressed  when  maternally  inherited78.  Within  this  cluster,  the 
ncRNA  Air  is  only  expressed  when  paternally  inherited.  Air  expression  correlates  with 
silencing  of  all  three  protein-coding  genes.  Transcription  of  Air  originates  from  an  intron 
of  Igf2r  and  is  transcribed  in  the  anti-sense  orientation  along  the  length  of  Igf2r, 
including  the  promoter  region78.  Early  studies  of  Air  suggested  promoter  occlusion  or 
anti-sense  RNA  mechanisms  for  silencing  of  Igf2r,  with  subsequent  spreading  of  silent 
chromatin  over  the  additional  genes  in  the  cluster78.  More  recent  studies  rule  out  these 
possibilities  and  suggest  a  different,  but  related,  mechanism.  Initial  silencing  of  Igf2r  and 
transcriptional  overlap  between  Igf2r  and  Air  is  not  required  for  proper  imprinting79. 

New  models  of  silencing  within  the  Igf2r  gene  cluster  liken  the  role  of  Air  to  Xist, 
suggesting  spreading  in  cis  within  a  domain,  rather  than  along  an  entire  chromosome79. 
Association  of  Air  with  the  lgf2r  cluster  is  accompanied  by  chromatin  modifications, 
such  as  deacetylation  and  histone  methylation80.  However,  differences  between  between 
Air  and  X/sMnediated  silencing  have  already  been  discovered.  Mutations  in  Eed  do  not 
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affect  silencing  at  the  Igf2r  locus81,  suggesting  that  histone  methylation  by  this  Polycomb 
group  complex  is  not  linked  with  this  imprinted  domain  as  it  is  with  X-chromosome 
inactivation. 

RNAi-based  initiation  of  silencing. 

Evidence  of  RNAi  was  first  studied  in  detail  in  C.  elegans.  Injection  of  anti-sense  RNA 
corresponding  to  the  par-1  gene  led  to  knock-down  of  par-1  expression,  as  anticipated82. 
Unexpectedly,  however,  injection  of  the  sense-strand  control  RNA  had  the  same  effect82. 
Furthermore,  injection  of  double  stranded  RNA  (dsRNA)  for  a  given  gene  was  more 
effective  in  silencing  the  gene  than  injection  of  either  strand  alone83.  Components  of  the 
pathway  involved  in  this  phenomenon,  called  RNA  interference  (RNAi),  have  recently 
been  elucidated.  dsRNA  is  recognized  by  the  Dicer  enzyme  that  cleaves  the  RNA  into 
small  interfering  RNA  (siRNA)  molecules  of  approximately  22  nucleotides  in  length84. 
The  siRNA  associates  with  the  RISC  complex  and  is  unwound  into  single  stranded 
antisense  RNA,  activating  the  RISC  complex.  Activated  RISC  associates  with  the  target 
mRNA,  with  the  single  stranded  antisense  RNA  annealing  to  the  region  of  homology, 
causing  cleavage  and  degradation84.  The  outcome  of  this  process,  referred  to  as  RNAi 
results  in  gene  silencing84. 

The  discovery  of  the  RNAi  mediated  gene-silencing  pathway  led  to  the  identification  of 
natural  gene  targets.  In  Drosophila,  tandem  repeated  Stellate  genes,  encoding  a 
regulatory  subunit  for  casein  kinase  II,  reside  in  a  heterochromatic  cluster  on  the  X- 
chromosome85.  The  Stellate  repeats  are  silenced  early  in  Drosophila  development  by 
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sense  and  anti-sense  transcripts  produced  from  homologus  sequences  at  the  Suppressor  of 
Stellate  [Su(ste)]  locus  on  the  Y  chromosome86.  Lack  of  silencing  of  the  Stellate  repeats 
in  male  testes  leads  to  meiotic  abnormalities  and  male  sterility85.  Mutations  in  aubergine 
(aub)  and  homeless  (his),  both  components  of  the  RISC  complex,  relieve  silencing  of  the 
Stellate  repeats85,86,  demonstrating  a  role  for  RNAi  in  heterochromatic  gene  silencing. 

Likewise,  repetitive  sequences  located  near  S.  pombe  centromeres  are  natural  targets  of 
the  RNAi  pathway87.  The  formation  of  silenced  chromatin  encompassing  these  repeats 
occurs  in  a  two-step  process  involving  initiation  and  spreading  (Figure  5B).  Initiation 
requires  processing  of  the  dsRNAs,  from  the  bi-directionally  transcribed  centric  repeats, 
through  the  RNAi  pathway.  These  siRNAs,  in  conjunction  with  RISC,  associate  with 
their  homologous  sequences  within  centric  regions.  Initiation  of  silencing  occurs  through 
the  recruitment  of  histone  modifying  proteins,  resulting  in  the  methylation  of  lysine  9  of 
histone  H3  that  serves  as  a  docking  site  for  Swi6,  the  S.  pombe  HP1  ortholog87. 

Consistent  with  this  process,  mutations  in  the  RNAi  pathway  lead  to  the  loss  of  Swi6  at 
the  centric  repeats87.  Spreading  of  silent  chromatin  is  independent  of  the  RNAi  pathway 
and  is  hypothesized  to  involve  the  histone  code-generating  machinery.  A  similar 
relationship  exists  between  RNAi  and  centric  heterochromatin  in  Drosophila88.  In 
contrast  to  the  post-transcriptional  gene  silencing  of  Stellate  repeats  described  above, 
silencing  near  centromeres  appears  to  occur  at  the  transcriptional  level,  through  the 
formation  of  silent  chromatin  initiated  by  siRNAs. 
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Conclusions 


Now  that  the  genomes  of  many  organisms  have  been  sequenced,  it  is  our  challenge  to 
discover  the  rules  that  regulate  gene  expression.  The  histone  code  appears  to  be  a  good 
candidate  for  predicting  patterns  of  gene  expression.  The  code  continues  to  grow  in 
complexity  with  both  the  identification  of  new  histone  modifications  and  knowledge 
about  how  these  modifications  influence  each  other  and  the  binding  of  non-histone 
chromatin  proteins89,90.  The  histone  code  might  be  initially  established  at  locations 
within  the  genome  by  the  distribution  of  DNA  sequence  elements.  These  elements  attract 
DNA-binding  proteins  that  further  recruit  chromatin-modifying  machinery,  thereby 
setting  up  the  histone  code.  Other  factors  that  might  influence  gene  expression  on  a 
domain  level  include  the  arrangement  of  chromatin  within  the  three-dimensional 
nucleus91.  In  some  cases,  positioning  of  chromosomes  within  the  nucleus  and 
localization  of  specific  RNA,  such  as  Xist,  are  required  for  gene  silencing.  Recent  data 
clearly  demonstrate  a  role  for  RNA  in  the  initiation  of  silent  chromatin.  Beyond 
initiation,  a  common  property  of  silent  chromatin  is  the  ability  to  “spread”  from  the 
initiation  site.  Mechanisms  responsible  for  this  process  are  not  well  defined,  and  are  the 
focus  of  future  studies. 
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Figure  Legends 


Figure  1.  Position  effect  variegation  across  species.  A.  In  Drosophila,  PEV  is 
observed  when  a  chromosomal  translocation  moves  the  white  gene  near  a  centromere. 
This  results  in  a  white  and  red  variegated  eye  phenotype.  Su( var)  mutations  result  in 
increased  expression  of  the  white  gene.  E(var)  mutations  result  in  decreased  expression 
of  the  white  gene.  B.  In  S.  pombe,  PEV  is  observed  when  the  ade6  transgene  is  placed 
near  a  centromere.  ade6  silencing  leads  to  red  pigmentation,  therefore,  PEV  results  in 
red  and  white  sectored  colonies.  C.  In  mouse  cells,  PEV  is  observed  when  a  CD2 
transgene  is  inserted  near  a  centromere.  Variegation  results  in  a  subpopulation  of  the  cells 
lacking  the  CD2  transmembrane  receptor  protein. 

Figure  2.  HP1  and  Pc  proteins  do  not  co-localize  on  Drosophila  polytene 
chromosomes.  Polytene  chromosomes  from  Drosophila  third  instar  larval  salivary 
glands  that  were  squashed  and  stained  with  antibodies  to  HP1  (red)  and  Ph,  a  PcG  protein 
(green).  The  chromocenter,  the  site  of  fusion  of  all  centromeres,  is  labeled  C. 

Figure  3.  Structure  of  HP1  and  Pc.  A.  Diagram  of  the  domain  structure  of  HP1 
protein,  showing  the  chromo  domain,  the  hinge  region,  and  the  chromoshadow  domain. 

B.  Diagram  of  the  domain  structure  of  Pc,  showing  the  chromo  domain  and  the  Pc  box. 

C.  The  chromo  domain  (lKNA.pdb  )  of  HP1  (blue)  forms  a  hydrophobic  pocket  around 
the  di-methylated  lysine  nine  (green)  of  the  histone  H3  peptide  (red).  D.  The  chromo 
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domain  (lPDQ.pdb)  of  Pc  (blue)  forms  a  cleft  that  specifically  interacts  with  tri- 
methylated  lysine  27  (green)  of  the  histone  H3  peptide  (red). 

Figure  4.  Mechanisms  of  HP1  and  Pc  gene  silencing.  A.  One  model  for  HP  1- 
mediated  gene  silencing  involves  HP1  binding  to  methylated  histones  and  forming  a 
regular  array  of  nucleosomes  that  blocks  transcriptional  machinery  from  gaining  access 
to  the  gene  promoter.  B.  One  model  for  Pc-mediated  gene  silencing  involves  the  Pc- 
complex  interacting  with  transcription  factors  at  the  gene  promoter  and  preventing 
engagement  of  the  polymerase. 

Figure  5.  Examples  of  ncRNA  and  RNAi-mediated  gene  silencing.  A.  An  example  of 
ncRNA-based  silencing  is  X-inactivation.  Xist  RNA  (pink  line)  is  expressed  from  the  Xi, 
triggering  a  series  of  histone  and  DNA  modifications  that  accompany  Eed-Enxl  and  HP1 
binding.  B.  In  5.  pombe,  siRNAs  corresponding  to  repetitive  sequences  located  near 
centromeres,  in  association  with  the  RISC  complex,  trigger  initiation  of  heterochromatin. 
Clr4  methylates  lysine  9  of  histone  H3  and  attracts  the  Swi6/HPl  protein. 
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T able  1.  Examples  of  epigenetic  modifications  and  their  consequences  on  nuclear  events 
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Heterochromatin  protein  1  (HP1)  is  a  conserved  chromosomal  protein  enriched  in 
heterochromatic  regions  of  the  genome.  In  humans  there  are  three  HP1  family  members, 
HPlHs",HPlHsp  andHPlHsy.  HP1  proteins  contain  two  conserved  domains,  an  amino 
chromo  domain  that  associates  with  methylated  lysine  9  of  histone  H3,  and  a  carboxy 
chromo  shadow  domain  that  interacts  with  a  variety  of  nuclear  proteins.  These  domains 
are  thought  to  regulate  protein-protein  interactions  that  influence  gene  expression. 

HP  1 Hsa  is  down-regulated  in  invasive/metastatic  breast  cancer  cells  compared  with  poorly 
invasive/non-metastatic  breast  cancer  cells.  Our  working  hypothesis  is  that  HP1  plays  a 
role  in  invasion/metastasis  through  the  regulation  of  gene  expression.  To  address  the  in 
vivo  function  of  HPlHsn  in  human  breast  cancer  (MCF-7)  cells,  we  have  used  two 
methods  for  RNAi  knock-down,  transfection  of  a  pool  of  short,  synthetic  double-stranded 
RNA  molecules,  and  infection  with  an  adenovirus  containing  short  hairpin  RNAi 
sequences  corresponding  to  HPlHsa.  Independently,  both  methods  have  resulted  in  up  to 
99%  HPlHsa  protein  knock-down.  Preliminary  experiments  indicate  the  knock-down  of 
HP1Hsci  is  associated  with  changes  in  cellular  morphology  and  adhesion.  These  changes 
include  a  decrease  in  cell  clustering  and  a  reduction  in  adherence.  Initial  studies  indicate 
differences  in  the  invasive  potential  of  the  knock-down  MCF-7  cells  based  on  in  vitro 
invasion  assays,  implicating  changes  in  adhesion  and  motility.  To  investigate  the 
correlation  between  these  phenotypes  and  the  effects  of  HPlHsa  on  gene  expression,  a 
microarray  analysis  will  be  performed  comparing  cells  with  and  without  HPlHsn  knock¬ 
down.  Collectively,  these  studies  will  elucidate  changes  that  occur  in  chromatin  and  gene 
expression  during  the  transition  in  breast  cancer  associated  with  invasion  and  metastasis. 
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Heterochromatin  protein  1  (HP1)  is  a  conserved  chromosomal  protein  enriched  in 
heterochromatic  regions  of  the  genome,  but  also  localizes  to  sites  within  euchromatin.  In 
humans  there  are  three  HP1  family  members,  HPlHsa,  HPlHsp  andHPlHsy.  HP1  proteins 
contain  two  conserved  domains,  an  amino  chromo  domain  that  associates  with 
methylated  lysine  9  of  histone  H3,  and  a  carboxy  chromo  shadow  domain  that  interacts 
with  a  variety  of  nuclear  proteins.  These  domains  are  thought  to  regulate  protein-protein 
interactions  that  influence  gene  expression.  HPlHsn  is  down-regulated  in 
invasive/metastatic  breast  cancer  cells  compared  with  poorly  invasive/non-metastatic 
breast  cancer  cells.  Our  working  hypothesis  is  that  HP1  plays  a  role  in 
invasion/metastasis  through  the  regulation  of  gene  expression.  To  address  the  in  vivo 
function  of  HP1Hs"  in  human  breast  cancer  (MCF-7)  cells,  we  have  used  two  methods  for 
RNAi  knock-down,  transfection  of  a  pool  of  short,  synthetic  double-stranded  RNA 
molecules,  and  infection  with  an  adenovirus  containing  short  hairpin  RNAi  sequences 
corresponding  to  HP1Hs“.  Independently,  both  methods  have  resulted  in  up  to  99% 
HP1Hs“  protein  knock-down.  Preliminary  experiments  indicate  the  knock-down  of 
HpiHsa  is  associated  with  changes  in  cellular  morphology  and  adhesion.  These  changes 
include  a  decrease  in  cell  clustering  and  a  reduction  in  adherence.  Initial  studies  indicate 
differences  in  the  invasive  potential  of  the  knock-down  MCF-7  cells  based  on  in  vitro 
invasion  assays,  implicating  changes  in  adhesion  and  motility.  To  investigate  the 
correlation  between  these  phenotypes  and  the  effects  of  HPlHsn  on  gene  expression,  a 
microarray  analysis  will  be  performed  comparing  cells  with  and  without  HP1Hs"  knock¬ 
down.  Collectively,  these  studies  will  elucidate  changes  that  occur  in  chromatin  and  gene 
expression  during  the  transition  in  breast  cancer  associated  with  invasion  and  metastasis. 
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Metastasis  is  the  process  by  which  cancerous  cells  break  away  from  a  primary  tumor  and 
establish  a  tumor  at  a  distant  location  within  the  body.  The  five-year  survival  rate  for 
women  with  a  primary  breast  cancer  tumor  is  97%.  In  contrast,  the  five-year  survival 
rate  for  women  with  metastatic  breast  cancer  is  only  23%.  Currently,  there  are  few 
molecular  markers  for  metastasis.  We  have  determined  that  Heterochromatin  protein  lHsa 
(HP1 Hsn)  is  down-regulated  in  highly  invasive/metastatic  breast  cancer  compared  to 
poorly  invasive/non-metastatic  breast  cancer.  HP1  proteins  are  enriched  at 
heterochromatic  regions  of  the  genome  and  play  a  role  in  gene  silencing.  HP1  proteins 
contain  a  chromo  domain  that  interacts  with  methylated  lysine  nine  of  histone  H3  and  a 
chromoshadow  domain  that  homodimerizes  and  interacts  with  a  variety  of  nuclear 
proteins.  We  propose  that  in  non-metastatic  breast  cancer  cells,  HP1 Hsa  functions  to 
silence  genes  that  promote  metastasis.  To  study  the  role  of  HP1Hs"  in  breast  cancer  cells, 
we  introduced  exogenous  HPlHsa  by  adenoviral  infection  into  the  highly  invasive/ 
metastatic  MDA-MB-23 1  breast  cancer  cells.  Expression  of  wild  type  HPlHs<i  in  MDA- 
MB-231  cells  reduces  the  invasive  potential  by  25%.  Expression  of  either  a  nuclear- 
localized  EGFP  or  a  mutant  HPlHsa  (I165E)  that  fails  to  homodimerize  does  not  change 
the  invasive  potential  of  the  MDA-MB-23 1  cells.  In  contrast,  introduction  of  a  mutant 
HPlHsa  (WI74A)  that  fails  to  interact  with  known  protein  partners  containing  a  PxVxL 
motif  leads  to  a  25%  decrease  in  invasion.  Taken  together,  these  data  demonstrate  that 
dimerization,  but  not  interactions  with  proteins  that  contain  a  PxVxL  motif,  is  required 
for  HP1Hs<1  to  suppress  invasion  in  metastatic  breast  cancer  cells.  Future  studies  will 
involve  identification  of  HPlHsra-regulated  genes  required  for  the  suppression  of  invasion. 
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An  estimated  200,000  American  women  are  diagnosed  with  invasive  breast 
cancer  each  year.  The  five-year  survival  rate  for  women  with  primary  breast  cancer  is 
97%,  but  the  rate  is  only  23%  for  women  diagnosed  with  metastatic  breast  cancer. 
Currently,  there  is  a  need  for  molecular  markers  for  metastasis.  We  have  determined  that 
the  chromatin  protein,  HP1Hs<1,  is  down-regulated  in  highly  invasive/metastatic  breast 
cancer  compared  to  poorly  invasive/non-metastatic  breast  cancer.  HP1Hs"  is  one  of  three 
Heterochromatin  Protein  1  (HPl)  family  members  in  mammals.  HP1  proteins  are 
structural  components  of  centric  heterochromatin  and  also  regulate  euchromatic  gene 
expression  through  epigenetic  mechanisms.  HPl  family  members  have  two  domains,  the 
chromo  domain  (CD)  and  the  chromo  shadow  domain  (CSD),  separated  by  a  flexible 
hinge  region.  The  CD  binds  to  methylated  lysine  nine  of  histone  H3,  an  epigenetic  mark 
for  gene  silencing.  The  CSD  homodimerizes  and  generates  a  platform  for  interaction 
with  a  variety  of  nuclear  proteins  containing  a  PxVxL  motif.  We  hypothesize  that 
Hp  j Hsa  functions  as  a  breast  cancer  metastasis  suppressor  through  silencing  genes  that 
promote  invasion  and  metastasis.  We  are  using  the  poorly  invasive/non-metastatic  cell 
line,  MCF-7,  and  the  highly  invasive/metastatic  breast  cancer  cell  line,  MDA-MB-231,  as 
a  model  system  to  study  the  role  of  HPlHsa  in  invasion  and  metastasis.  Introduction  of 
RNAi  constructs  causing  knock-down  of  HPlHsa  in  MCF-7  cells  increases  their  in  vitro 
invasion  by  50%.  Consistent  with  this  finding,  introduction  of  exogenous  HPlHsa  into 
MDA-MB-231  cells  reduces  in  vitro  invasion  by  29%.  An  amino  acid  substitution  in  the 
CSD  (I165E)  that  disrupts  HPlHsn  homodimerization  does  not  suppress  invasion  of 
MDA-MB-231  cells,  but  an  amino  acid  substitution  in  the  CSD  (W174A)  that  disrupts 
the  platform  needed  for  interaction  with  proteins  containing  the  PxVxL  motif  suppresses 
in  vitro  invasion  by  32%.  These  results  indicate  that  HPlHsn  dimerization  is  required  for 
suppression  of  invasion.  The  decrease  in  in  vitro  invasion  caused  by  introduction  of  either 
wild-type  HPlHsa  or  the  W174A  mutant  correlates  with  an  increase  in  levels  of  integrin 
a3  mRNA.  Future  studies  include  the  identification  of  additional  genes  misregulated  by 
altered  HPlHsn  dosage  through  candidate  gene  and  microarray  analyses.  Understanding 
the  regulation  and  function  of  HPlHsrt  in  breast  cancer  invasion  and  metastasis  will 
increase  our  knowledge  of  metastatic  progression  and  may  lead  to  new  ways  to  diagnose 
and  treat  metastasis. 
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The  effects  of  HPlHsa  knock-down  on  breast  cancer  invasion/metastasis.  An 

estimated  200,000  American  women  are  diagnosed  with  invasive  breast  cancer  each  year. 
The  five-year  survival  rate  for  women  with  primary  breast  cancer  is  98%,  but  the  rate  is 
only  26%  for  women  diagnosed  with  metastatic  breast  cancer.  Currently,  the  molecular 
mechanism  of  metastatic  progression  is  not  well  understood.  We  have  determined  that 
heterochromatin  protein  1,  HPlHsa,  is  down-  regulated  in  highly  invasive/metastatic 
breast  cancer  compared  to  poorly  invasive/non-  metastatic  breast  cancer.  HP1  proteins 
are  structural  components  of  centric  heterochromatin  and  also  regulate  gene  expression 
through  epigenetic  mechanisms.  HP1  family  members  have  two  domains,  the  chromo 
domain  (CD)  and  the  chromo  shadow  domain  (CSD),  separated  by  a  flexible  hinge 
region.  The  CD  binds  to  methylated  lysine  nine  of  histone  H3,  an  epigenetic  mark  for 
gene  silencing.  The  CSD  homodimerizes  and  generates  a  platform  for  interaction  with  a 
variety  of  nuclear  proteins  containing  a  PxVxL  motif.  We  hypothesize  that  HPl-Hsalpha 
functions  as  a  breast  cancer  metastasis  suppressor  by  regulating  genes  involved  in 
metastasis.  A  poorly  invasive/non-  metastatic  cell  line,  MCF-7,  was  used  as  models  to 
study  the  role  of  HPlHsa  in  invasion  and  metastasis.  Introduction  of  RNAi  constructs 
causing  knock-down  of  HPlHsa  in  MCF-7  cells  increases  their  in  vitro  invasion  by  50%. 
Current  studies  include  the  identification  of  genes  misregulated  by  alterations  in  dosage 
of  HPl-Hsalpha  through  microarray  and  candidate  gene  analyses.  Understanding  the 
regulation  and  function  of  HPl-Hsalpha  in  breast  cancer  invasion  and  metastasis  will 
increase  our  knowledge  of  metastatic  progression  and  may  lead  to  new  ways  to  diagnose 
and  treat  metastasis. 
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An  estimated  200,000  American  women  are  diagnosed  with  invasive  breast  cancer  each 
year.  The  five-year  survival  rate  for  women  with  primary  breast  cancer  is  97%,  but  the 
rate  is  only  23%  for  women  diagnosed  with  metastatic  breast  cancer.  Currently,  the 
molecular  mechanism  of  metastatic  progression  is  not  well  understood.  We  have 
determined  that  heterochromatin  protein  1,  HPl-Hsalpha,  is  down-regulated  in  highly 
invasive/metastatic  breast  cancer  compared  to  poorly  invasive/non-metastatic  breast 
cancer.  HPl-Hsalpha  is  one  of  three  HP1  family  members  in  mammals.  HP1  proteins 
are  structural  components  of  centric  heterochromatin  and  also  regulate  gene  expression 
through  epigenetic  mechanisms.  HP1  family  members  have  two  domains,  the  chromo 
domain  (CD)  and  the  chromo  shadow  domain  (CSD),  separated  by  a  flexible  hinge 
region.  The  CD  binds  to  methylated  lysine  nine  of  histone  H3,  an  epigenetic  mark  for 
gene  silencing.  The  CSD  homodimerizes  and  generates  a  platform  for  interaction  with  a 
variety  of  nuclear  proteins  containing  a  PxVxL  motif.  We  hypothesize  that  HPl-Hsalpha 
functions  as  a  breast  cancer  metastasis  suppressor  by  regulating  genes  involved  in 
metastasis.  A  highly  invasive/metastatic  breast  cancer  cell  line,  MDA-MB-231,  and  a 
poorly  invasive/non-metastatic  cell  line,  MCF-7,  were  used  as  models  to  study  the  role  of 
HPl-Hsalpha  in  invasion  and  metastasis.  Reduced  levels  of  HPl-Hsalpha  in  MDA-MB- 
231  cells  is  partially  due  to  transcriptional  down-regulation  by  an  E-box  regulatory 
element  within  the  promoter  region.  Introduction  of  exogenous  HPl-Hsalpha  into  MDA- 
MB-231  cells  reduces  in  vitro  invasion  by  29%.  An  amino  acid  substitution  in  the  CSD 
(I165E)  that  disrupts  HPl-Hsalpha  homodimerization  does  not  suppress  invasion  of 
MDA-MB-231  cells,  but  an  amino  acid  substitution  in  the  CSD  (W174A)  that  disrupts 
the  platform  needed  for  interaction  with  proteins  containing  the  PxVxL  motif  suppresses 
in  vitro  invasion  by  32%.  These  results  indicate  that  HPl-Hsalpha  dimerization  is 
required  for  suppression  of  invasion.  Consistent  with  these  findings,  introduction  of 
RNAi  constructs  causing  knock-down  of  HPl-Hsalpha  in  MCF-7  cells  increases  their  in 
vitro  invasion  by  50%.  Current  studies  include  the  identification  of  genes  misregulated 
by  alterations  in  dosage  of  HPl-Hsalpha  through  microarray  and  candidate  gene  analyses. 
Understanding  the  regulation  and  function  of  HPl-Hsalpha  in  breast  cancer  invasion  and 
metastasis  will  increase  our  knowledge  of  metastatic  progression  and  may  lead  to  new 
ways  to  diagnose  and  treat  metastasis. 


